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(57) Abstract 

Disclosed is a nucleotide sequence encoding an effective portion of a class A starch branching enzyme (SBE) obtainable from potato 
plants, or a functional equivalent thereof, together with, inter alia, a corresponding polypeptide, a method of altering the characteristics of 
a plant, a plant having altered characteristics; and starch, particularly starch obtained from a potato plant, having novel properties. 
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Title: Improvements in or Relating to Plant Starch Composition 

Field of the Invention 

This invention relates to novel nucleotide sequences, polypeptides encoded thereby, vectors 
and host cells and host organisms comprising one or more of the novel sequences, and to 
a method of altering one or more characteristics of an organism. The invention al;so 
relates to starch having novel properties and to uses thereof. 

Background of the Invention 

Starch is the major form of carbon reserve in plants, constituting 50% or more of the dry 
weight of many storage organs - e.g. tubers, seeds of cereals. Starch is used in numerous 
food and industrial applications. In many cases, however, it is necessary to modify the 
native starches, via chemical or physical means, in order to produce distinct properties to 
suit particular applications. It would be highly desirable to be able to produce starches 
with the required properties directly in the plant, thereby removing the need for additional 
modification. To achieve this via genetic engineering requires knowledge of the metabolic 
pathway of starch biosynthesis! This includes characterisation of genes and encoded gene 
products which catalyse the synthesis of starch. Knowledge about the regulation of starch 
biosynthesis raises the possibility of "re-programming" biosynthetic pathways to create 
starches with novel properties that could have new commercial applications. 

The commercially useful properties of starch derive from the ability of the native granular 
form to swell and absorb water upon suitable treatment. Usually heat is required to cause 
granules to swell in a process known as gelatinisation, which has been defined (W A 
Atwell et aU Cereal Foods World 55, 306-311, 1988) as "... the collapse (disruption) of 
molecular orders within the starch granule manifested in irreversible changes in properties 
such as granular swelling, native crystallite melting, loss of birefringence, and starch 
solubilisation. The point of initial gelatinisation and the range over which it occurs is 
governed by starch concentration, method of observation, granule type, and heterogeneities 
within the granule population under observation". A number of techniques are available 
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cited above) as "...a process which occurs when the molecules comprising gelatinised 
starch begin to reassociate in an ordered structure..,". At 50 °C, it is primarily the 
amyiose component which reassociates, as indicated by the increase in viscoamylograph 
viscosity for starch from normal maize (21.6% amyiose) compared with starch from waxy 
maize (1.1% amyiose) as shown in Figure 2. Figure 2 is a viscoamylograph of 10%w/w 
starch suspensions from waxy maize (solid line), conventional maize (dots and dashes), 
high amyiose variety (hylon 5. dotted line) and a very high amyiose variety (hylon 7, 
crosses). The temperature profile is also shown by a solid line, as in Figure 1. The 
extent of viscosity increase in the viscoamylograph on cooling and holding at 50 °C 
depends on the amount of amyiose which is able to reassociate due to its exudation from 
starch granules during the gelatinisation and pasting processes. A characteristic of 
amylose-rich starches from maize plants is that very little amyiose is exuded from granules 
by gelatinisation and pasting up to 95 °C, probably due to the restricted swelling of the 
granules. This is illustrated in Figure 2 which shows low viscosities for a high amyiose 
(44.9%) starch (Hylon 5) from maize during gelatinisation and pasting at 95 °C and little 
increase in viscosity on cooling and holding at 50 °C. This effect is more extreme for a 
higher amyiose content (58% , as in Hylon 7), which shows even lower viscosities in the 
viscoamylograph test (Figure 2). For commercially-available high amyiose starches 
(currently available from maize plants, such as those described above), processing at 
greater than 100°C is usually necessary in order to generate the benefits of high amyiose 
contents with respect to increased rates and strengths of reassociation, but use of such high 
temperatures is energetically unfavourable and costly. Accordingly, there is an unmet 
need for starches of high amyiose content which can be processed below 100°C and still 
show enhanced levels of reassociation, as indicated for example by viscoamylograph 
measurements. 

The properties of potato starch are useful in a variety of both food and non-food (paper, 
textiles, adhesives etc.) applications. However, for many applications, properties are not 
optimum and various chemical and physical modifications well known in the an are 
undertaken in order to improve useful properties. Two types of property manipulation 
which would be of use are: the controlled alteration of gelatinisation and pasting 
temperatures; and starches which suffer less granular fragmentation during pasting than 
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between the two classes. One general distinction of note would appear to be the presence, 
in class A SBE molecules, of a flexible N-terminal domain, which is not found in class 
B molecules. The distinctions noted by Burton et al. are relied on herein to define class 
A and class B SBE molecules, which terms are to be interpreted accordingly. 

However in potato, only one isoform of the SBE molecule (belonging to class B) has thus 
far been reported and only one gene cloned (Blennow & Johansson, 1991 Phytochem. 30, 
WI-MA, and Kofimann et aL, 1991 MoL Gen, Genet. 250, 39-44). Further, published 
attempts to modify the properties of starch in potato plants (by preventing expression of 
the single known SBE) have generally not succeeded (e.g. Miiller-Rober & KoBmann 1994 
Plant Cell and Environment 1 7, 601-613). 

Summary of the Invention 

In a first aspect the invention provides a nucleotide sequence encoding an effective portion 
of a class A starch branching enzyme (SBE) obtainable from potato plants. 

Preferably the nucleotide sequence encodes a polypeptide comprising an effective portion 
of the amino acid sequence shown in Figure 5 (excluding the sequence MNKRIDL, which 
does not represent part of the SBE molecule), or a functional equivalent thereof (which 
term is discussed below). The amino acid sequence shown in Figure 5 (Seq ID No. 15) 
includes a leader sequence which directs the polypeptide, when synthesised in potato cells, 
to the amyloplast. Those skilled in the an will recognise that the leader sequence is 
removed to produce a mature enzyme and that the leader sequence is therefore not 
essential for enzyme activity. Accordingly, an "effective portion" of the polypeptide is 
one which possesses sufficient SBE activity to complement the branching enzyme mutation 
in E. coli KV 832 ceils (described below) and which is active when expressed in E. coli 
in the phosphorylation stimulation assay. An example of an incomplete polypeptide which 
nevertheless constitutes an "effective portion" is the mature enzyme lacking the leader 
sequence. By analogy with the pea class A SBE sequence, the potato class A sequence 
shown in Figure 5 probably possesses a leader sequence of about 48 amino acid residues, 
such that the N terminal amino acid sequence is thought to commence around the glutamic 
acid residue (E) at position 49 (EKSSYN... etc.). Those skilled in the an will appreciate 
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sequence. Accordingly, functionally equivalent sequences will also include those 
sequences which can hybridise, under stringent hybridisation conditions, to the sequence 
of the invention (rather than the complement thereof). Such "antisense" equivalents will 
preferably possess at least 85%, more preferably at least 90%, and most preferably 95% 
sequence homology with the complement of the sequence of the invention as exemplified 
by nucleotides 289 to 2790 of the DNA sequence shown in Figure 5. Particular functional 
equivalents are shown, for example, in Figures 8 and 10 (if one disregards the various 
frameshift mutations noted therein). 

The invention also provides vectors, particularly expression vectors, comprising the 
nucleotide sequence of the invention. The vector will typically comprise a promoter and 
one or more regulatory signals of the type well known to those skilled in the art. The 
invention also includes provision of cells transformed (which term encompasses 
transduction and transfection) with a vector comprising the nucleotide sequence of the 
invention. 

The invention further provides a class A SBE polypeptide, obtainable from potato plants. 
In particular the invention provides the polypeptide in substantially pure form, especially 
in a form free from other plant-derived (especially potato plant-derived) components, 
which can be readily accomplished by expression of the relevant nucleotide sequence in 
a suitable non-plant host (such as any one of the yeast strains routinely used for expression 
purposes, e.g. Pichia spp. or Saccharomyces spp). Typically the enzyme will substantially 
comprise the sequence of amino acid residues 49 to 882 shown in Figure 5 (disregarding 
the sequence MNKRIDL, which is not part of the enzyme), or a functional equivalent 
thereof. The polypeptide of the invention may be used in a method of modifying starch 
in vitro, comprising treating starch under suitable conditions (e.g. appropriate temperature, 
pH, etc) with an effective amount of the polypeptide according to the invention. 

The term "functional equivalent", as applied herein to amino acid sequences, is intended 
to encompass amino acid sequences substantially similar to that shown in Figure 5 r such 
that the polypeptide possesses sufficient activity to complement the branching enzyme 
mutation in E. coli KV 832 cells (described below) and which is active in E. coli in the 
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at least one third of the full length sequence, but by simple trial and error other fragments 
(smaller or larger) may be found which are functional in altering the characteristics of the 
plant. 

Thus, in a further aspect the invention provides a method of altering the characteristics of 
a plant, comprising introducing into the plant an effective portion of the sequence of the 
invention operably linked to a suitable promoter active in the plant. Conveniently the 
sequence will be linked in the anti-sense orientation to the promoter. Preferably the plant 
is a potato plant. Conveniently, the characteristic altered relates to the starch content 
and/or starch composition of the plant (i.e. amount and/or type of starch present in the 
plant). Preferably the method of altering the characteristics of the plant will also comprise 
the introduction of one or more further sequences, in addition to an effective portion of 
the sequence of the invention. The introduced sequence of the invention and the one or 
more further sequences (which may be sense or antisense sequences) may be operably 
linked to a single promoter (which would ensure both sequences were transcribed at 
essentially the same time), or may be operably linked to separate promoters (which may 
be necessary for optimal expression). Where separate promoters are employed they may 
be identical to each other or different. Suitable promoters are well known to those skilled 
in the an and include both constitutive and inducible types. Examples include the CaMV 
35S promoter (e.g. single or tandem repeat) and the patatin promoter. Advantageously 
the promoter will be tissue-specific. Desirably the promoter will cause expression of the 
operably linked sequence at substantial levels only in the tissue of the plant where starch 
synthesis and/or starch storage mainly occurs. Thus, for example, where the sequence is 
introduced into a potato plant, the operably linked promoter may be tuber-specific, such 
as the patatin promoter. 

Desirably, for example, the method will also comprise the introduction of an effective 
portion of a sequence encoding a class B SBE, operably linked in the antisense orientation 
to a suitable promoter active in the plant. Desirably the further sequence will comprise 
an effective portion of the sequence encoding the potato class B SBE molecule. 
Conveniently the further sequence will comprise an effective portion of the sequence 
described by Blennow & Johansson (1991 Phytochem. 30, 437-444) or that disclosed in 
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chips (amy lose generally being used as a coating to prevent fat uptake), and to prepare 
mashed potato (especially "instant" mashed potato) having particular characteristics. 

In particular relation to potato plants, the invention provides a potato plant or part thereof 
which, in its wild type possesses an effective SBE A gene, but which plant has been 
altered such that there is no effective expression of an SBE A polypeptide within the cells 
of at least part of the plant. The plant may have been altered by the method defined 
above, or may have been selected by conventional breeding to be deleted for the class A 
SBE gene, presence or absence of which can be readily determined by screening samples 
of the plants with a nucleic acid probe or antibody specific for the potato class A gene or 
gene product respectively. 

The invention also provides starch extracted from a plant altered by the method defined 
above, or the progeny of such a plant, the starch having altered properties compared to 
starch extracted from equivalent, but unaltered, plants. The invention further provides a 
method of making altered starch, comprising altering a plant by the method defined above 
and extracting therefrom starch having altered properties compared to starch extracted 
from equivalent, but unaltered, plants. Use of nucleotide sequences in accordance with 
the invention has allowed the present inventors to produce potato starches having a wide 
variety of novel properties. 

In particular the invention provides the following: a plant (especially a potato plant) altered 
by the method defined above, containing starch which, when extracted from the plant, has 
an elevated endotherm peak temperature as judged by DSC, compared to starch extracted 
from a similar, but unaltered, plant; a plant (especially a potato plant) altered by the 
method defined above, containing starch which, when extracted from the plant, has an 
elevated viscosity onset temperature (conveniently elevated by 10 - 25°C) as judged by 
viscoamylograph compared to starch extracted from a similar, but unaltered, plant; a plant 
(especially a potato plant) altered by the method defined above, containing starch which, 
when extracted from the plant, has a decreased peak viscosity (conveniently decreased by 
240 - 700SNUs) as judged by viscoamylograph compared to starch extracted from a 
similar, but unaltered, plant; a plant (especially a potato plant) altered by the method 



BNSDOCID:<WO 9634968A2> 



WO 96/34968 




PCT/GB96/01075 



phase (step 5) and has a set-back viscosity of 303 SNUs or less as judged by 
viscoamylograph according to the protocol defined below. 



For the purposes of the present invention, viscoamylograph conditions are understood to 
pertain to analysis of a 10% (w/w) aqueous suspension of starch at atmospheric pressure, 
using a Newport Scientific Rapid Visco Analyser with a heating profile of: holding at 50°C 
for 2 minutes (step 1), heating from 50 to 95°C at a rate of 1.5°C per minute (step 2), 
holding at 95°C for 15 minutes (step 3), cooling from 95 to 50°C at a rate of 1.5°C per 
minute (step 4), and then holding at 50°C for 15 minutes (step 5). Peak viscosity may be 
defined for present purposes as the maximum viscosity attained during the heating phase 
(step 2) or the holding phase (step 3) of the viscoamylograph. Pasting viscosity may be 
defined as the viscosity attained by the starch suspensions at the end of the holding phase 
(step 3) of the viscoamylograph. Set-back viscosity may be defined as the viscosity of the 
starch suspension at the end of step 5 of the viscoamylograph. 

In yet another aspect the invention provides starch from a potato plant having an apparent 
amy lose content (% w/w) of at least 35%, as judged by iodometric assay according to the 
method described by Morrison & Laigneiet (1983 J. Cereal Science 7, 9-20). Preferably 
the starch will have an amy lose content of at least 40%, more preferably at least 50%, and 
most preferably at least 66%. Starch obtained directly from a potato plant and having 
such properties has not hitherto been produced. Indeed, as a result of the present 
invention, it is now possible to generate in vivo potato starch which has some properties 
analogous to the very high amy lose starches (e.g. Hylon 7) obtainable from maize. 

Starches with high (at least 35%) amylose contents find commercial application as, 
amongst other reasons, the amylose component of starch reassociates more strongly and 
rapidly than the amylopectin component during retrogradation processes. This may result, 
for example, in pastes with higher viscosities, gels of greater cohesion, or films of greater 
strength for starches with high (at least 35%) compared with normal (less than 35%) 
amylose contents. Alternatively, starches may be obtained with very high amylose 
contents, such that the granule structure is substantially preserved during heating, resulting 
in starch suspensions which demonstrate substantially no increase in viscosity during 
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Under the heading of "Industrial" applications, the novel starches of the invention may be 
advantageously employed, for example, in corrugating adhesives, in biodegradable 
products such as loose fill packaging and foamed shapes, and in the production of glass 
fibers and textiles. 

Those skilled in the an will appreciate that the novel starches of the invention may, if 
desired, be subjected in vitro to conventional enzymatic, physical and/or chemical 
modification, such as cross-linking, introduction of hydrophobic groups (e.g. octenyl 
succinic acid, dodecyl succinic acid), or derivatization (e.g. by means of esterification or 
etherification). 

In yet another aspect the invention provides high (35% or more) amylose starches which 
generate paste viscosities greater than those obtained from high amylose starches from 
maize plants after processing at temperatures below 100° C. This provides the advantage 
of more economical starch gelatinisation and pasting treatments through the use of lower 
processing temperatures than are currently required for high amylose starches from maize 
plants. . 

The invention will now be further described by way of illustrative example and with 
reference to the drawings, of which: 

Figure 1 shows a typical viscoamylograph for a 10% w/w suspension of potato starch; 

Figure 2 shows vsicoamylographs for 10% suspensions of starch from various maize 
varieties; 

Figure 3 is a schematic representation of the cloning strategy used by the present 
inventors; 

Figure 4a shows the amino acid alignment of the C-terminal portion of starch branching 
enzyme isoforms from various sources; amino acid residues matching the consensus 
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Examples 
Example 1 

Cloning of Potato class A SBE 

The strategy for cloning the second form of starch branching enzyme from potato is shown 
in Figure 3 . The small arrowheads represent primers used by the inventors in PCR and 
RACE protocols. The approximate size of the fragments isolated is indicated by the 
numerals on the right of the Figure. By way of explanation, a comparison of the amino 
acid sequences of several cloned plant starch branching enzymes (SBE) from maize (class 
A), pea (class A), maize (class B), rice (class B) and potato (class B), as well as human 
glycogen branching enzyme, allowed the inventors to identify a region in the 
carboxy-terminal one third of the protein which is almost completely conserved 
(GYLNFMGNEFGHPEWIDFPR) (Figure 4a). A multiple alignment of the DNA 
sequences (human, pea class A, potato class B, maize class B, maize class A and rice class 
B, respectively) corresponding to this region is shown in Figure 4b and was used to design 
an oligo which would potentially hybridize to all known plant starch branching enzymes: 
AATTT(C/T)ATGGGIAA(C/T)GA(A/G)TT(C/T)GG (Seq ID No. 20). 

Library PCR 

The initial isolation of a partial potato class A SBE cDNA clone was from an amplified 
potato tuber cDNA library in the XZap vector (Stratagene). One half /xL of a potato 
cDNA library (titre 2.3 x 10 9 pfti/mL) was used as template in a 50 reaction containing 
100 pmol of a 16 fold degenerate POTSBE primer and 25 pmol of a T7 primer (present 
in the XZap vector 3* to the cDNA sequences - see Figure 3), 100 jiM dNTPs, 2.5 U Taq 
polymerase and the buffer supplied with the Taq polymerase (Stratagene). All components 
except the enzyme were added to a 0.5 mL microcentrifuge tube, covered with mineral 
oil and incubated at 94°C for 7 minutes and then held at 55 °C, while the Taq polymerase 
was added and mixed by pipetting. PCR was then performed by incubating for 1 min at 
94°C, 1 min at 58°C and 3 minutes at 72°C. for 35 cycles. The PCR products were 
extracted with phenol/chloroform, ethanol precipitated and resuspended in TE pH 8.0 
before cloning into the T/A cloning vector pT7BlueR (Invitrogen). 
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primer. Excess primers were, removed on a Centricon 100 column and cDNA was 
recovered and precipitated with isopropanol. cDNA was A-tailed in a volume of 20 ix\ 
using 10 units terminal transferase (BRL), 200 /xM dATP for 10 min at 37 °C, followed 
by 5 min at 65 °C. The reaction was then diluted to 0.5 ml with TE pH 8 and stored at 
4°C as the cDNA pool. cDNA clones were isolated by PCR amplification using the 
primers R^dT^, ^ and POTSBE24. The PCR was performed in 50 fiL using a hot start 
technique: 10 /xL of the cDNA pool was heated to 94°C in water for 5 min with 25 pmol 
POTSBE24, 25 pmol Ro and 2.5 pmol of RoR,dT l7 and cooled to 75°C. Five fiL of 10 
x PCR buffer (Stratagene), 200 /xM dNTPs and 1.25 units of Taq polymerase were added, 
the mixture heated at 45 °C for 2 min and 72°C for 40 min followed by 35 cycles of 94°C 
for 45 sec, 50°C for 25 sec, 72°C for 1.5 min and a final incubation at 72°C for 10 min. 
PCR products were separated by electrophoresis on 1 % low melting agarose gels and the 
smear covering the range 600-800 bp fragments was excised and used in a second PCR 
amplification with 25 pmol of Rj and POTSBE25 primers in a 50 fih reaction (28 cycles 
of 94°C for 1 min, 50°C 1 min, 72°C 2 min). Products were purified by chloroform 
extraction and cloned into pT7 Blue. PCR was used to screen the colonies and the longest 
clones were sequenced; 

The first round of RACE only extended the length of the SBE sequence approximately 100 
bases, therefore a new A-tailed cDNA library was constructed using the class A SBE 
specific oligoPOTSBE24 (10 pmol) in an attempt to recover longer RACE products. The 
first and second round PCR reactions were performed using new class A SBE primers 
(POTSBE 28 and 29 respectively) derived from the new sequence data. Conditions were 
as before except that the elongation step in the first PCR was for 3 min and the second 
PCR consisted of 28 cycles at 94 °C for 45 seconds, 55 °C for 25 sec and 72 °C for 1 
min 45 sec. 

Clones ranging in size from 400 bp to 1.4 kb were isolated and sequenced. The combined 
sequence of the longest RACE products and cDNA clones predicted a full length gene of 
about 3150 nucleotides, excluding the poly (A) tail (psbe 2con.seq in Fig. 8). 

As the sequence of the 5' half of the gene was compiled from the sequence of several 
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degree of similarity, identical residues being denoted by the common letter, conservative 
changes by two dots and neutral changes by a single dot. Dashes indicate gaps introduced 
to optimise the alignment. The class A SBE protein has 44% identity over the entire 
length with potato class B SBE, and 56% identity therewith in the central conserved 
domain (Figure 6), as judged by the "Megalign" program (DNASTAR). However. Figure 
7 shows a comparison between potato class A SBE (top row, residues 1-873) and pea class 
A SBE (bottom row, residues 1-861), from which it can be observed that cloned potato 
gene is more homologous to the class A pea enzyme, where the identity is 70 % over 
nearly the entire length, and this increases to 83 % over the central conserved region 
(starting at IPPP at position "170). It is clear from this analysis that this cloned potato 
SBE gene belongs to the class A family of SBE genes. 

An E. coli culture, containing the plasmid pSJ78 (which directs the expression of a full 
length potato SBE Class A gene), has been deposited (on 3rd January 1996) under the 
terms of the Budapest Treaty at The National Collections of Industrial and Marine Bacteria 
Limited (23 St Machar Drive, Aberdeen, AB2 1RY, United Kingdom), under accession 
number NCIMB 40781. Plasmid pSJ78 is equivalent to clone 19 described above. It 
represents a full length SBE A cDNA blunt-end ligated into the vector pBSSKHP. 

Polymorphism of class A SBE genes 

Sequence analysis of the other two full length class A SBE genes showed that they contain 
frameshift mutations and are therefore unable to encode full length proteins and indeed 
they were unable to complement the branching enzyme deficiency in the KV832 mutant 
(described below). An alignment of the full length DNA sequences is shown in Figure 
8: "lOcon.seq" (Seq ID No. 12), "19con.seq" (Seq ID No. 14) and "llcon.seq" (Seq ID 
No. 13) represent the sequence of full length clones 10, 19 and 11 obtained by PCR using 
the PBER1 and PBERT primers (see below), whilst "psbe2con.seq M (Seq ID No. 18) 
represents the consensus sequence of the RACE clones and cDNA clone 3.2.1. Those 
nucleotides which differ from the overall consensus sequence (not shown) are shaded. 
Dashes indicate gaps introduced to optimise the alignment. Apart from the frameshift 
mutations these clones are highly homologous. It should be noted that the 5' sequence of 
psbe2con is longer because this is the longest RACE product and it also contains several 
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A comparison of all the full length sequences is shown in Figure 10. In addition to clones 
10, 11 and 19 are shown the sequences of a Bgl II - Xho I product cloned directly into the 
QE32 expression vector C86CON.SEQ\ Seq ID No. 16) and the consensus sequence of 
the directly sequenced PCR products ("pcrsbe2con.seq", Seq ID No. 17). Those 
nucleotides which differ from the consensus sequence (not shown) are shaded. Dashes 
indicate gaps introduced to optimise the alignment. There are 1 1 nucleotide differences 
predicted to be present in the mRNA population, which are indicated by asterisks above 
and below the sequence. The other differences are probably PCR artefacts or possibly 
sequencing errors. 

Complementation of a branching enzyme deficient E. coli mutant 
To determine if the isolated SBE gene encodes an active protein i.e. one that has 
branching enzyme activity, a complementation test was performed in the E. coli strain 
KV832. This strain is unable to make bacterial glycogen as the gene for the glycogen 
branching enzyme has been deleted (Keil et aL, 1987 Mol. Gen. Genet. 207, 294-301). 
When wild type cells are grown in the presence of glucose they synthesise glycogen (a 
highly branched glucose polymer) which stains a brown colour with iodine, whereas the 
KV832 cells make only a linear chain glucose polymer which stains blueish green with 
iodine. To determine if the cloned SBE gene could restore the ability of the KV832 cells 
to make a branched polymer, the clone pSJ90 (Seq ID No. 19) was used and constructed 
as below. The construct is a PCR-derived, substantially full length fragment (made using 
primers PBE 2B and PBE 2X ; detailed below), which was cut with Bgl II and Xho I and 
cloned into the BamH I / Sal I sites of the His-tag expression vector pQE32 (Qiagen). 
This clone, pSJ86, was sequenced and found to have a frameshift mutation of two bases 
in the 5' half of the gene. This frameshift was removed by digestion with Nsi I and SnaB 
I and replaced with the corresponding fragment from a Taq-generated PCR clone to 
produce the plasmid pSJ90 (sequence shown in Figure 12; the first 10 amino acids are 
derived from the expression vector). The polypeptide encoded by pSJ90 would be 
predicted to correspond to amino acids 46-882 of the full SBE coding sequence. The 
construct pSJ90 was transformed into the branching enzyme deficient KV832 cells and 
transformants were grown on solid PYG medium (0.85% KH 2 P0 4 , 1.1% K 2 HP0 4 , 0.6% 
yeast extract) containing 1.0% glucose. To test for complementation, a loop of cells was 
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phosphorylation stimulation assay and that the cloned gene does indeed code for a protein 
with branching enzyme activity. 

Oligonucleotides 

The following synthetic oligonucleotides (Seq ID No.s 1-11 respectively) were used: 



RoR,dT l7 


AAGGATCCGTCGACATCGATAATACGACTCACTATAGGGA(T) 




AAGGATCCGTCGACATC 


R« 


GACATCGATAATACGAC 


POTSBE24 


CATCCAACCACCATCTCGCA 


POTSBE25 


TTGAGAGAAGATACCTAAGT 


POTSBE28 


ATGTTCAGTCCATCTAAAGT 


POTSBE29 


AGAACAACAATTCCTAGCTC 


PBER 1 


GGGGCCTTGAACTCAGCAAT 


PBERT 


CGTCCCAGCATTCGACATAA 


PBE 2B 


CTTGGATCCTTGAACTCAGCAATTTG 


PBE 2X 


TAACTCGAGCAACGCGATCACAAGTTCGT 



Example 2 

Production of Transgenic Plants 

Construction of plant transformation vectors with antisense starch branching enzyme 
genes 

A 1200 bp Sac I - Xho I fragment, encoding approximately the -COOH half of the potato 
class A SBE (isolated from the rescued XZap clone 3.2.1), was cloned into the Sac I - Sal 
I sites of the plant transformation vector pSJ29 to create plasmid pSJ64, which is 
illustrated schematically in Figure 11. In the figure, the black line represents the DNA 
sequence. The broken line represents the bacterial plasmid backbone (containing the 
origin of replication and bacterial selection marker), which is not shown in full. The filled 
triangles on the line denote the T-DNA borders (RB = right border, LB = left border). 
Relevant restriction sites are shown above the black line, with the approximate distances 
(in kilobases) between the sites (marked by an asterisk) given by the numerals below the 
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Transformation was confirmed in all cases by production of a DNA fragment from the 
antisense gene after PCR in the presence of appropriate primers and a crude extract of 
genomic DNA from each regenerated shoot. 

Characterisation of starch from potato plants 

Starch was extracted from plants as follows: potato tubers were homogenised in water for 
2 minutes in a Waring blender operating at high speed. The homogenate was washed and 
filtered (initially through 2mm, then through 1mm filters) using about 4 litres of water per 
lOOgms of tubers (6 extractions). Washed starch granules were Finally extracted with 
acetone and air dried. 

Starch extracted from singly transformed potato plants (class A/SBE II antisense, or class 
B/SBE I antisense), or from double transformants (class A/SBE II and class B/SBE I 
antisense), or from untransformed control plants, was partially characterised. The results 
are shown in Table 1 . The table shows the amount of SBE activity (units/gram tissue) in 
tubers from each transformed plant. The endotherm peak temperature (°C) of starch 
extracted from several plants was determined by DSC, and the onset temperature (°C) of 
pasting was determined by reference to a viscoamylograph ("RVA"), as described in WO 
95/26407. The viscoamylograph profile was as follows: step 1 - 50°C for 2 minutes; step 
2 - increase in temperature from 50°C to 95°C at a rate of 1.5°C per minute; step 3 - 
holding at 95°C for 15 minutes; step 4 - cooling from 95°C to 50°C at a rate of 1.5°C per 
minute; and finally, step 5 - holding at 50°C for 15 minutes. Table 1 shows the peak, 
pasting and set-back viscosities in stirring number units (SNUs), which is a measure of 
the amount of torque required to stir the suspensions. Peak viscosity may be defined for 
present purposes as the maximun viscosity attained during the heating phase (step 2) or 
the holding phase (step 3) of the viscoamylograph. Pasting viscosity may be defined as 
the viscosity attained by the starch suspensions at the end of the holding phase (step 3) of 
the viscoamylograph. Set-back viscosity may be defined as the viscosity of the starch 
suspension at the end of step 5 of the viscoamylograph. 

A determination of apparent amy lose content (% w/w) was also performed, using the 
iodometric assay method of Morrison & Laignelet (1983 J. Cereal Sci. i, 9-20). The 
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149 (35.6% amylose); shaded triangle, pointing upwards - plant 152 (37.5%); shaded 
triangle, pointing downwards - plant 161 (40.9%); shaded diamond - plant 150 (53.1%); 
unshaded box - plant 236a (56.7%); unshaded circle - plant 236 (60.4%); unshaded 
triangle, pointing upwards - plant 201 (66.4%); unshaded triangle, pointing downwards - 
Hylon V starch, from maize (44.9 % amylose). The thin line denotes the heating profile. 

With increasing amylose content, peak viscosities during processing to 95°C decrease, and 
the drop in viscosity from the peak until the end of the holding period at 95°C also 
generally decreases (indeed, for some of the starch samples there is an increase in 
viscosity during this period). Both of these results are indicative of reduced granule 
fragmentation, and hence increased granule stability during pasting. This property has not 
previously been available in potato starch without extensive prior chemical or physical 
modification. For applications where a maximal viscosity after processing to 95 °C is 
desirable (i.e. corresponding to the viscosity after 47 minutes in the viscoamylograph test), 
starch from plant #152 would be selected as starches with both lower (Controls, #149) and 
higher (#161, #150) amylose contents have lower viscosities following this gelatinisation 
and pasting regime (Figure 13 and Table 1). It is believed that the viscosity at this stage 
is determined by a combination of the extent of granule swelling and the resistance of 
swollen granules to mechanical fragmentation. For any desired viscosity behaviour, one 
skilled in the an would select a potato starch from a range containing different amylose 
contents produced according to the invention by performing suitable standard viscosity 
tests. 

Upon cooling pastes from 95 °C to 50°C, potato starches from most plants transformed 
in accordance with the invention showed an increase in viscoamylograph viscosity as 
expected for partial reassociation of amylose. Starches from plants #149, 152 and 161 all 
show viscosities at 50°C significantly in excess of those for starches from control plants 
(Figure 13 and Table 1). This contrasts with the effect of elevated amylose contents in 
starches from maize plants (Figure 2) which show very low viscosities throughout the 
viscoamylograph test. Of particular note is the fact that, for similar amylose contents, 
starch from potato plant 150 (53% amylose) shows markedly increased viscosity compared 
with Hylon 5 starch (44.9% amylose) as illustrated in Figure 13. This demonstrates that 
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201, 208 and 208a did not display a clearly defined pasting viscosity peak during the 
heating phase. Microscopic analysis confirmed that the starch granule structure underwent 
only minor swelling during the experimental heating process. This property may well be 
particularly useful in certain applications, as will be apparent to those skilled in the an. 

Some re-grown plants have so far been found to increase still further the apparent amylose 
content of starch extracted therefrom. Such increases may be due to:- 

i) Growth and development of the first generation transformed plants may have been 
affected to some degree by the exogenous growth hormones present in the tissue culture 
system, which exogenoous hormones were not present during growth of the second 
generation plants; and 

ii) Subsequent generations were grown under field conditions, which may allow for 
attainment of greater maturity than growth under laboratory conditions, it being generally 
held that amylose content of potato starch increases with maturity of the potato tuber. 
Accordingly, it should be possible to obtain potato plants giving rise to tubers with starch 
having an amylose content in excess of the 66% level so far attained, simply by analysing 
a greater number of transformed plants and/or by re-growing transgenic plants through one 
or more generations under field conditions. 

Table 1 shows that another characteristic of starch which is affected by the presence of 
anti-sense sequences to SBE is the phosphorus content. Starch from untransformed control 
plants had a phosphorus content of about 60-70mg/100sram dry weight (as determined 
according to the AOAC Official Methods of Analysis, 15th Edition, Method 948.09 
"Phosphorus in Flour"). Introduction into the plant of an anti-sense SBE B sequence was 
found to cause a modest increase (about two-fold) in phosphorus content, which is in 
agreement with the previous findings reported at scientific meetings. Similarly, anti-sense 
to SBE A alone causes only a small rise in phosphorus content relative to untransformed 
controls. However, use of anti-sense to both SBE A and B in combination results in up 
to a four-fold increase in phosphorus content, which is far greater than any in planta 
phosphorus content previously demonstrated for potato starch. 

This is useful in that, for certain applications, starch must be phosphorylated in vitro by 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: National Starch and Chemical Investment 

Holding Corporation 

(B) STREET: 501 Silverside Road. Suite 27 

(C) CITY: Wilmington 

(D) STATE: Delaware 

(E) COUNTRY: United States of America 

(F) POSTAL CODE (ZIP): 19809 

(ii) TITLE OF INVENTION: Improvements in or Relating to Plant Starch 
Composition 

(iii) NUMBER OF SEQUENCES: 20 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : Patentln Release #1.0. Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
AAGGATCCGT CGACATCGAT AATACGACTC ACTATAGGGA I M I I 1 1 I I I 1 1 I I I 11 57 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AAGGATCCGT CGACATC 17 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: .17 base pairs 
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(xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 7: 



AGAACAACAA TTCCTAGCTC 



20 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGGGCCTTGA ACTCAGCAAT 20 



(2) INFORMATION FOR SEQ ID NO: 9: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CGTCCCAGCA TTCGACATAA 20 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTTGGATCCT TGAACTCAGC AATTTG 26 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 
TAACTCGAGC AACGCGATCA CAAGTTCGT 29 
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GAACGCCCGA 


CGACCTTAAG 


TCTTTGATTG 


ATAAAGCTCA 


TGAGCTAGGA 


ATTGTTGTTC 


1440 


TCATGGACAT 


TGTTCACAGC 


CATGCATCAA 


ATAATACTTT 


AGATGGACTG 


AACATGTTTG 


1500 


ACGGCACAGA 


TAGTTGTTAC 


TTTCACTCTG 


GAGCTCGTGG 


TTATCATTGG 


ATGTGGGATT 


1560 


TCCGCCTCTT 


TAACTATGGA 


AACTGGGAGG 


TACTTAGGTA 


TCTTCTCTCA 


AATGCGAGAT 


1620 


GGTGGTTGGA 


TGAGTTCAAA 


TTTGATGGAT 


TTAGATTTGA 


TGGTGTGACA 


TCAATGATGT 


1680 


GTACTCACCA 


CGGATTATCG 


GTGGGATTCA 


CTGGGAACTA 


CGAGGAATAC 


TTTGGACTCG 


1740 


CAACTGATGT 


GGATGCTGTT 


GTGTATCTGA 


TGCTGGTCAA 


CGATCTTATT 


CATGGGCTTT 


1800 


TCCCAGATGC 


AATTACCATT 


GGTGAAGATG 


TTAGCGGAAT 


GCCGACATTT 


TGTGTTCCCG 


1860 


TTCAAGATGG 


GGGTGTTGGC 


TTTGACTATC 


GGCTGCATAT 


GGCAATTGCT 


GATAAATGGA 


1920 


TTGAGTTGCT 


CAAGAAACGG 


GATGAGGATT 


GGAGAGTGGG 


TGATATTGTT 


CATACACTGA 


1980 


CAAATAGAAG 


ATGGTCGGAA 


AAGTGTGTTT 


CATACGCTGA 


MGTCATGAT 


CAAGCTCTAG 


2040 


TCGGTGATAA 


AACTATAGCA 


TTCTGGCTGA 


TGGACAAGGA 


TATGTATGAT 


TTTATGGCTC 


2100 


TGGATAGACC 


GTCAACATCA 


TTAATAGATC 


GTGGGATAGC 


ATTACACAAG 


ATGATTAGGC 


2160 


TTGTAACTAT 


GGGATTAGGA 


GGAGAAGGGT 


ACCTAAATTT 


CATGGGAAAT 


GAATTCGGCC 


2220 


ACCCTGAGTG 


GATTGATTTC 


CCTAGGGCTG 


AACAACACCT 


CTCTGATGGC 


TCAGTAATTC 


2280 


CCAGAAACCA 


ATTCAGTTAT 


GATAAATGCA 


GACGGAGATT 


TGACCTGGGA 


GATGCAGAAT 


2340 


ATTTAAGATA 


CCGTGGGTTG 


CAAGAATTTG 


ACCGGGCTAT 


GCAGTATCTT 


GAAGATAAAT 


2400 


ATGAGTTTAT 


GACTTCAGAA 


CACCAGTTCA 


TATCACGAAA 


GGATGAAGGA 


GATAGGATGA 


2460 


TTGTATTTGA 


AAAAGGAAAC 


CTAGI 1 1 MG 


TCTTTAATTT 


TCACTGGACA 


AAAGGCTATT 


2520 


CAGACTATCG 


CATAGGCTGC 


CTGAAGCCTG 


GAAAATACM 


GGTTGCCTTG 


GACTCAGATG 


2580 


ATCCACI 1 1 1 


TGGTGGCTTC 


GGGAGAATTG 


ATCATAATGC 


CGAATATTTC 


ACCI 1 IGMG 


2640 


GATGGTATGA 


TGATCGTCCT 


CGTTCAATTA 


TGGTGTATGC 


ACCTAGTAGA 


ACAGCAGTGG 


2700 


TCTATGCACT 


AGTAGACAAA 


GAAGAAGAA.G 


AAGAAGAAGA 


AGTAGCAGTA 


GTAGAAGAAG 


2760 


TAPTAPTAP A 

1 Ab I Ab 1 AbA 


AP A AP A ATP A 

AbAAbAA 1 bA 


APPA APTTP - 

ALbAAL 1 1 b i 


r ATrrrrTTP 


A A AP ATTTP A 

AAAbAI 1 IbA 


APPPP APATA 

ALbLuALA 1 A 


ZoZv 


GAGCTTCTTG 


ACGTATCTGG 


CAATATTGCA 


TTAGTCTTGG 


CGGAATTTCA 


TGTGACAACA 


2880 


GGTTTGCAAT 


TCTTTCCACT 


ATTAGTAGTG 


CAACGATATA 


CGCAGAGATG 


AAGTGCTGAA 


2940 


CAAAAACATA 


TGTAAAATCG 


ATGAATTTAT 


GTCGAATGCT 


GGGACGATCG 


AATTCCTGCA 


3000 



GCC 3003 
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GAACGCCCGA 


CGACCTTAAG 


TCTTCGATTG 


TCATGGACAT 


CGTTCACAGC 


CATGCATCAA 


ACGGCACCGA 


TAGTTGTTAC 


TTTCACTCTG 


CCGCCTC 1 1 1 


AACTATGGAA 


ACTGGGAGGT 


GTGGTTGGAT 


GAGTTCAAAT 


TTGATGGATT 


TACTCACCAC 


GGATTATCGG 


TGGGATTCAC 


AACTGATGTG 


GATGCTGTTG 


TGTATCTGAT 


CCCAGATGCA 


ATTACCATTG 


GTGAAGATGT 


TCAAGATGGG 


GGTGTTGGCT 


TTGACTATCG 


TGAGTTGCTC 


AAGAAACGGG 


ATGAGGATTG 


AAATAGAAGA 


TGGTCGGAAA 


AGTGTGTTTC 


CGGTGATAAA 


ACTATAGCAT 


TCTGGCTGAT 


GGATAGACCG 


CCAACATCAT 


TAATAGATCG 


TGTAACTATG 


GGATTAGGAG 


GAGAAGGGTA 


CCCTGAGTGG 


ATTGAI 1 ICC 


CTAGGGCTGA 


CGGAAACCAA 


TTCAGTTATG 


ATAAATGCAG 


TTTAAGATAC 


CATGGGTTAC 


AAGAATTTGA 


TGAGTTTATG 


ACTTCAGAAC 


ACCAGTTCAT 


TGTATTTGAA 


AGAGGAAACC 


TAGI 1 1 I'CGT 


AGACTATCGC 


ATAGGCTGCC 


TGAAGCCTGG 


TCCAC 1 1 1 1 1 


GGTGGCTTCG 


GGAGAATTGA 


ATCGTATGAT 


GATCGTCCTT 


GTTCAATTAT 


CTATGCACTA 


GTAGACAAAC 


TAGAAGTAGC 


GAACTTGTGA 


TCGCGTTGAA 


AGAl 1 iGAAC 


ACATCAGTCT 


TGGCGGAATT 


GCATGTGACA 


AGTCCACCGA 


TATACGCAGA 


GATGAAGTGC 


ATGTCGAATG 


CTGGGACGAT 


CGAATTCCTG 
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ATAAAGCTCA TGAGCTAGGA ATTGTTGTTC 1440 

ATAATACTTT AGATGGACTG AACATGTTTG 1500 

GAGCTCGTGG TTATCATTGG ATGTGGGATT 1560 

ACTTAGGTAT CTTCTCTCAA ATGCGAGATG 1620 

TAGATTCGAT GGTGTGACAT CAATGATGTA 1680 

TGGGAACTAC GAGGAATACT TTGGACTCGC 1740 

GCTGGTCAAC GATCTTATTC ATAGGCTTTT 1800 

TAGCGGAATG CCGACATTTT GTATTCCCGT 1860 

GCTGCATATG GCAATTGCTG ATAAATGGAT 1920 

GAGAGTGGGT GATATTGTTC ATACACTGAC 1980 

ATACGCTGAA AGTCATGATC AAGCTCTAGT 2040 

GGACAAGGAT ATGTATGATT TTATGGCTCT 2100 

TGGGATAGCA TTGCACAAGA TGATTAGGCT 2160 

CCTAAATTTC ATGGGAAATG AATTCGGCCA 2220 

GCCACACCTT TCTGATGGCT CAGTAATTCC 2280 

ACGGAGATTT GACCTGGGAG ATGCAGAATA 2340 

CTGGGCTATG CAGTATCTTG AAGATAAATA 2400 

ATCACGAAAG GATGAAGGAG ATAGGATGAT 2460 

CTTTAATTTT CACTGGACAA ATAGCTATTC 2520 

AAAATACAAG GTTGTCTTGG ACTCAGATGA 2580 

TCATAATGCC GAATATTTCA CCTCTGAAGG 2640 

GGTGTATGCA CCTAGTAGAA CAGCAGTGGT 2700 

AGTAGTAGAA GAACCCATTG AAGAATGAAC 2760 

GTTACTTGGT CATCCACATA GAGCTTCTTG 2820 

ACAAGGTTTG CAGTTCTTTC CACTATTAGT 2880 

TGAACAAACA TATGTAAAAT CGATGAATTT 2940 

CAGCC 2975 
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CTA CM CTA CM GM GGT GGT MA CTG GAG GAG TCT AM ACA TTA MT 603 

Leu Gin Leu Gin Glu Gly Gly Lys Leu Glu Glu Ser Lys Thr Leu Asn 
140 145 150 

ACT TCT GM GAG ACA ATT ATT GAT GM TCT GAT AGG ATC AGA GAG AGG 651 

Thr Ser Glu Glu Thr He lie Asp Glu Ser Asp Arg He Arg Glu Arg 
155 160 165 

GGC ATC CCT CCA CCT GGA CTT GGT CAG MG ATT TAT GM ATA GAC CCC 699 

Gly He Pro Pro Pro Gly Leu Gly Gin Lys He Tyr Glu He Asp Pro 
170 175 180 185 



CTT TTG ACA MC TAT CGT CM CAC CTT GAT TAC AGG TAT TCA CAG TAC 747 
Leu Leu Thr Asn Tyr Arg Gin His Leu Asp Tyr Arg Tyr Ser Gin Tyr 
190 195 200 

MG MA CTG AGG GAG GCA ATT GAC MG TAT GAG GGT GGT TTG GM GCC 795 
Lys Lys Leu Arg Glu Ala He Asp Lys Tyr Glu Gly Gly Leu Glu Ala 
205 210 215 

TTT TCT CGT GGT TAT GM MA ATG GGT TTC ACT CGT AGT GCT ACA GGT 843 
Phe Ser Arg Gly Tyr Glu Lys Met Gly Phe Thr Arg Ser Ala Thr Gly 
220 225 230 

ATC ACT TAC CGT GAG TGG GCT CTT GGT GCC CAG TCA GCT GCC CTC ATT 891 
He Thr Tyr Arg Glu Trp Ala Leu Gly Ala Gin Ser Ala Ala Leu He 
235 240 245 

GGA GAT TTC MC MT TGG GAC GCA MT GCT GAC ATT ATG ACT CGG MT 939 
Gly Asp Phe Asn Asn Trp Asp Ala Asn Ala Asp He Met Thr Arg Asn 
250 255 260 265 

GM TTT GGT GTC TGG GAG ATT TTT CTG CCA MT MT GTG GAT GGT TCT 987 
Glu Phe Gly Val Trp Glu lie Phe Leu Pro Asn Asn Val Asp Gly Ser 
270 275 280 

CCT GCA ATT CCT CAT GGG TCC AGA GTG MG ATA CGT ATG GAC ACT CCA 1035 
Pro Ala He Pro His Gly Ser Arg Val Lys He Arg Met Asp Thr Pro 
285 290 295 

TCA GGT GTT MG GAT TCC ATT CCT GCT TGG ATC MC TAC TCT TTA CAG 1083 
Ser Gly Val Lys Asp Ser He Pro Ala Trp He Asn Tyr Ser Leu Gin 
300 305 310 

CTT CCT GAT GM ATT CCA TAT MT GGA ATA CAT TAT GAT CCA CCC GM- 1131 
Leu Pro Asp Glu He Pro Tyr Asn Gly He His Tyr Asp Pro Pro Glu 
315 320 325 

GAG GAG AGG TAT ATC TTC CAA CAC CCA CGG CCA MG MA CCA MG TCG 1179 
Glu Glu Arg Tvr He Phe Gin His Pro Arg Pro Lys Lys Pro Lys Ser 
330 335 340 345 

CTG AGA ATA TAT GM TCT CAT ATT GGA ATG AGT AGT CCG GAG CCT AM 1227 
Leu Arg He Tvr Glu Ser His He Glv Met Ser Ser Pro Glu Pro Lys 
350 355 360 
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ATG GCA ATT GCT GAT AAA CGG'ATT GAG TTG CTC AAG AAA CGG GAT GAG 1947 

Met Ala He Ala Asp Lys Arg lie Glu Leu Leu Lys Lys Ara Asp Glu 
590 595 " 600 

GAT TGG AGA GTG GGT GAT ATT GTT CAT ACA CTG ACA AAT AGA AGA TGG 1995 
Asd Trp Arg Val Gly Asp He Val His Thr Leu Thr Asn Arg Arg Trp 
605 610 615 

TCG GAA AAG TGT GTT TCA TAC GCT GAA AGT CAT GAT CAA GCT CTA GTC 2043 
Ser Glu Lys Cys Val Ser Tyr Ala Glu Ser His Asp Gin Ala Leu Val 
620 625 630 

GGT GAT AAA ACT ATA GCA TTC TGG CTG ATG GAC AAG GAT ATG TAT GAT 2091 
Gly Asp Lys Thr He Ala Phe Trp Leu Met Asp Lys Asp Met Tyr Asp 
635 640 645 

TTT ATG GCT CTG GAT AGA CCG TCA ACA TCA TTA ATA GAT CGT GGG ATA 2139 
Phe Met Ala Leu Asp Arg Pro Ser Thr Ser Leu He Asp Arg Gly He 
650 655 660 ' 665 

GCA TTG CAC AAG ATG ATT AGG CTT GTA ACT ATG GGA TTA GGA GGA GAA 2187 
Ala Leu His Lys Met He Arg Leu Val Thr Met Gly Leu Gly Gly Glu 
670 675 680 

GGG TAC CTA AAT TTC ATG GGA AAT GAA TTC GGC CAC CCT GAG TGG ATT 2235 
Gly Tyr Leu Asn Phe Met Gly Asn Glu Phe Gly His Pro Glu Trp He 
685 690 695 

GAT TTC CCT AGG GCT GAA CAA CAC CTC TCT GAT GGC TCA GTA ATC CCC 2283 
Asp Phe Pro Arg Ala Glu Gin His Leu Ser Asp Gly Ser Val He Pro 
700 705 710 

GGA AAC CAA TTC AGT TAT GAT AAA TGC AGA CGG AGA TTT GAC CTG GGA 2331 
Gly Asn Gin Phe Ser Tyr Asp Lys Cys Arg Arg Arg Phe Asp Leu Gly 
715 720 725 

GAT GCA GAA TAT TTA AGA TAC CGT GGG TTG CAA GAA TTT GAC CGG CCT 2379 
Asp Ala Glu Tyr Leu Arg Tyr Arg Gly Leu Gin Glu Phe Asp Arg Pro 
730 735 740 745 

ATG CAG TAT CTT GAA GAT AAA TAT GAG TTT ATG ACT TCA GAA CAC CAG 2427 
Met Gin Tyr Leu Glu Asp Lys Tyr Glu Phe Met Thr Ser Glu His Gin 
750 755 760 

TTC ATA TCA CGA AAG GAT GAA GGA GAT AGG ATG ATT GTA TTT GAA AAA 2475 
Phe He Ser Arg Lys Asp Glu Gly Asd Arg Met He Val Phe Glu Lys 
765 770 775 

GGA AAC CTA GTT TTT GTC TTT AAT TTT CAC TGG ACA AAA AGC TAT TCA 2523 
Gly Asn Leu Val Phe Val Phe Asn Phe His Trp ihr Lys Ser Tyr Ser 
780 785 790 

GAC TAT CGC ATA GCC TGC CTG AAG CCT GGA AAA TAC AAG GTT GCC TTG 2571 
Asp Tvr Arg lie Ala Cys Leu Lys Pro Gly Lys ivr Lys Val Ala Leu 
795 800 805 
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Ala Ser Thr Asp Val Asp Ser Ser Thr Met Glu His Ala Ser Gin He 
100 105 110 

Lys Thr Glu Asn Asp Asp Val Glu Pro Ser Ser Asp Leu Thr Gly Ser 
115 120 125 

Val Glu Glu Leu Asd Phe Ala Ser Ser Leu Gin Leu Gin Glu Gly Gly 
130 135 140 

Lys Leu Glu Glu Ser Lys Thr Leu Asn Thr Ser Glu Glu Thr He Tie 
145 . • 150 155 160 

Asp Glu Ser Asp Arg lie Arg Glu Arg Gly He Pro Pro Pro Gly Leu 
165 170 175 

Gly Gin Lys He Tyr Glu He Asp Pro Leu Leu Thr Asn Tyr Arg Gin 
180 185 . 190 

His Leu Asp Tyr Arg Tyr Ser Gin Tyr Lys Lys Leu Arg Glu Ala He 
195 200 205 

Asp Lys Tyr Glu Gly Gly Leu Glu Ala Phe Ser Arg Gly Tyr Glu Lys 
210 215 220 

Met Gly Phe Thr Arg Ser Ala Thr Gly He Thr Tyr Arg Glu Trp Ala 
225 230 235 240 

Leu Gly Ala Gin Ser Ala Ala Leu He Gly Asp Phe Asn Asn Trp Asp 
245 250 255 

Ala Asn Ala Asp He Met Thr Arg Asn Glu Phe Gly Val Trp Glu He 
260 265 270 

Phe Leu Pro Asn Asn Val Asp Gly Ser Pro Ala He Pro His Gly Ser 
275 280 ' 285 

Arg Val Lys He Arg Met Asp Thr Pro Ser Gly Val Lys Asp Ser He 
290 295 300 

Pro Ala Trp He Asn Tyr Ser Leu Gin Leu Pro Asp Glu He Pro Tyr 
305 310 315 320 

Asn Gly He His Tyr Asp Pro Pro Glu Glu Glu Arg Tyr He Phe Gin 
325 330 335 

His Pro Arg Pro Lys Lys Pro Lys Ser Leu Arg He Tyr Glu Ser His 
340 ■ 345 350 

He Gly Met Ser Ser Pro Glu Pro Lys He Asn Ser Tyr Val Asn Phe 
355 360 365 

Arg Asp Glu Val Leu Pro Arg He Lys Lys Leu Gly Tyr Asn Ala Leu 
370 375 380 

Gin He Met Ala He Gin Glu His Ser Tyr Tyr Ala Ser Phe Gly Tyr 
385 . 390 395 400 
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His Leu Ser Asp Gly Ser Val He Pro Gly Asn Gin Phe Ser Tyr Asp 
705 710 . 715 720 

Lys Cys Arg Arg Arg Phe Asp Leu Gly Asd Ala Glu Tyr Leu Arg Tyr 
725 730 735 

.Arg Gly Leu Gin Glu Phe Asp Arg Pro Met Gin Tyr Leu Glu. Asp Lys 



Gly Asp Arg Met He Val Phe Glu Lys Gly Asn Leu Val Phe Val Phe 
770 775 780 

Asn Phe His Trp Thr Lys Ser Tyr Ser Asp Tyr Arg He Ala Cys Leu 
785 790 795 800 

Lys Pro Gly Lys Tyr Lys Val Ala Leu Asp Ser Asp Asp Pro Leu Phe 
805 810 815 

Gly Gly Phe Gly Arg He Asp His Asn Ala Glu Tyr Phe Thr Phe Glu 
820 825 830 

Gly Trp Tyr Asp Asp Arg Pro Arg Ser He Met Val Tyr Ala Pro Cys 
835 840 845 

Lys Thr Ala Val Val Tyr Ala Leu Val Asp Lys Glu Glu Glu Glu Glu 
850 855 860 

Glu Glu Glu Glu Glu Glu Val Ala Ala Val Glu Glu Val Val Val Glu 
865 870 875 880 

Glu Glu 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TCATTAAAGA GGAGAAATTA ACTATGAGAG GATCTCACCA TCACCATCAC CATGGGATCT 
TGGCTGAAAA GTCTTCTTAC AATTCCGAAT TCCGACCTTC TACAGTTGCA GCATCGGGGA 
AAGTCCTTGT GCCTGGAACC CAGAGTGATA GCTCCTCATC CTCAACAAAC CAATTTGAGT 
TCACTGAGAC ATCTCCAGAA AATTCCCCAG CATCAACTGA TGTAGATAGT TCAACAATGG 
AACACGCTAG CC AG ATT AAA ACTGAGAACG ATGACGTTGA GCCGTCAAGT GATCTTACAG 



60 
120 
180 
240 
300 



740 




750 



Tyr Glu Phe MeVTPir Ser Glu His Gin 'Phe He Ser Arg Lys" Asp" Glu 
755 760 765 
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GGAAATGAAT TCGGCCACCC TGAGTGGATT GATTTCCCTA GGGCTGAACA ACACCTCTCT 2040 

GATGACTCAG TAATTCCCGG AAACCAATTC AGTTATGATA AATGCAGACG GAGATTTGAC 2100 

CTGGGAGATG CAGAATATTT AAGATACCGT 6GGTTGCAAG AATTTGACCG GGCTATGCAG 2160 

TATCTTGAAG ATAAATATGA jGTTTATGACT T.CAGAACACC AGTTCATATC ACGAAAGGAT 2220 

GAAGGAGATA GGAT GATTGt ATTTGAAAAA GGAAACCTAG TTTTTGTCTT TAATTTTCAC 2280 

TGGACAAAAA GCTATTCAGA CTATCGCATA GGCTGCCTGA AGCCTGGAAA ATACAAGGTT 2340 

GCCTTGGACT CAGATGATCC ACTTTTTGGT GGCTTCGGGA GAATTGATCA TAATGCCGAA 2400 

TATTTCACCT TTGAAGGATG GTATGATGAT CGTCCTCGTT CAATTATGGT GTATGCACCT 2460 

TGTAGAACAG CAGTGGTCTA TGCACTAGTA GACAAAGAAG AAGAAGAAGA AGAAGAAGAA 2520 

GAAGAAGTAG CAGTAGTAGA AGAAGTAGTA GTAGAAGAAG AATGAACGAA CTTGTG 2576 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GGATGCTAAT GTTTCTGTAT TCTTGAAAAA GCACTCTCTT TCACGGAAGA TCTTGGCTGA 60 

AAAGTCTTCT TACAATTCCG AATCCCGACC TTCTACAGTT GCAGCATCGG GGAAAGTCCT 120 

TGTGCCTGGA AYCCAGAGTG ATAGCTCCTC ATCCTCAACA GACCAATTTG AGTTCACTGA 180 

GACATCTCCA GAAAATTCCC CAGCATCAAC TGATGTAGAT AGTTCAACAA TGGAACACGC 240 

TAGCCAGATT AAAACTGAGA ACGATGACGT TGAGCCGTCA AGTGATCTTA CAGGAAGTGT 300 

TGAAGAGCTG GATTTTGCTT CATCACTACA ACTACAAGAA GGTGGTAAAC TGGAGGAGTC 360 

TAAAACATTA AATACTTCTG AAGAGACAAT TATTGATGAA TCTGATAGGA TCAGAGAGAG 420 

GGGCATCCCT CCACCTGGAC TTGGTCAGAA GATTTATGAA ATAGACCCCC TTTTGACAAA 480 

CTATCGTCAA CACCTTGATT ACAGGTATTC ACAGTACAAG AAACTGAGGG AGGCAATTGA 540 

CAAGTATGAG GGTGGTTTGG AAGCTTTTTC TCGTGGTTAT GAAAAAATGG GTTTCACTCG 600 

TAGTGCTACA GGTATCACTT ACCGTGAGTG GGCTCCTGGT GCCCAGTCAG CTGCCCTCAT 660 

TGGAGATTTC AACAATTGGG ACGCAAATGC TGACATTATG ACTCGGAATG AATTTGGTGT 720 

CTGGGAGATT TTTCTGCCAA ATAATGTGGA TGGTTCTCCT GCAATTCCTC ATGGGTCCAG 780 
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AACAGCAGTG GTCTATGCAC TAGTAGACAA ANTAG.AAGNA GAAGAAGAAG AAGAANCCGN 2520 

NGAAGAATT 2529 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GATTTAATAC GACTCACTAT AGGGATTTTT 1 1 It M I i I I TTTTAAAAAC CTCCTCCACT 60 

CAGTCTTGGG ATCTCTCTCT CTCTTCACGC TTCTCTTGGG GCCTTGAACT CAGCAATTTG 120 

ACACTCAGTT AGTTACACTC CTATCACTCA TCAGATCTCT ATTTTTTCTC TTAATTCCAA 180 

CCAAGGAATG AATTAAAAGA TTAGATTTGA AGGAGAGAAG AAGAAAGATG GTGTATACAC 240 

TCTCTGGAGT TCGTTTTCCT ACTGTTCCAT CAGTGTACAA ATCTAATGGA TTCAGCAGTA 300 

ATGGTGATCG GAGGAATGCT AATGTTTCTG TATTCTTGAA AAAGCACTCT CTTTCACGGA 360 

AGATCTTGGC TGAAAAGTCT TCTTACGATT CCGAATCCCG ACCTTCTACA GTTGCAGCAT 420 

CGGGGAAAGT CCTTGTACCT GGAATCCAGA GTGATAGCTC CTCATCCTCA ACAGACCAAT 480 

TTGAGTTCAC TGAGACAGCT CCAGAAAATT CCCCAGCATC AACTGATGTG GATAGTTCAA 540 

CAATGGAACA CGCTAGCCAG ATTAAAACTG AGAACGATGA CGTTGAGCCG TCAAGTGATC 600 

TTACAGGAAG TGTTGAAGAG TTGGATTTTG CTTCATCACT ACAACTACAA GAAGGTGGTA 660 

AACTGGAGGA GTCTAAAACA TTAAATACTT CTGAAGAGAC AATTATTGAT GAATCTGATA 720 

GGATCAGAGA GAGGGGCATC CCTCCACCTG GACTTGGTCA GAAGATTTAT GAAATAGACC 780 

CCCTTTTGAC AAACTATCGT CAACACCTTG ATTACAGGTA TTCACAGTAC AAGAAAATGA 840 

GGGAGGCAAT TGACAAGTAT GAGGGTGGTT TGGAAGCTTT TTCTCGTGGT TATGAAAAAA 900 

TGGGTTTCAC TCGTAGTGCT ACAGGTATCA CTTACCGTGA GTGGGCTCCT GGTGCCCAGT 960 

CAGCTGCTCT CATTGGAGAT TTCAACAATT GGGACGCAAA TGCTGACATT ATGACTCGGA 1020 

ATGAATTTGG TGTCTGGGAG ATTTTTCTGC CAAATAATGT GGATGGTTCT CCTGCAATTC 1080 

CTCATGGGTC CAGAGTGAAG ATACGCATGG ACACTTCATC AGGTGTTAAG GATTCCATTC 1140 

CTGCTTGGAT CAACTACTCT TTACAGCTTC CTGATGAAAT TCCATATAAT GGAATATATT 1200 

ATGATCCACC CGAAGAGGAG AGGTATGTCT TCCAACACCC ACGGCCAAAG AAACCAAAGT 1260 
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GCGGAATTTC ATGTGACAAA AGGTTTGCAA TTCTTTCCAC TATTAGTAGT GCAACGATAT 3000 

ACGCAGAGAT GAAGTGCTGA ACAAACATAT GTAAAATCGA TGAATTTATG TCGAATGCTG 3060 

GGACGGGCTT CAGCAGGTTT TGCTTAGTGA GTTCTGTAAA TTGTCATCTC TTTANATGTA 3120 

GAGCCCACTA GAAATCAATT ATGTGAGACC TAAAAAACAA TAACCATAAA ATGGAAATAG 3180 

TGCTGATCTA ATGATGTTTT AANCCNNNNA AAAAAAAAAA AAAAACTCGA G 3231 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



TCATTAAAGA 


GGAGAAATTA 


ACTATGAGAG 


GATCTCACCA 


TCACCATCAC 


CATGGGATCT 


60 


TGGCTGAAAA 


GTCTTCTTAC 


AATTCCGAAT 


TCCGACCTTC 


TACAGTTGCA 


GCATCGGGGA 


120 


AAGTCCTTGT 


GCCTGGAACC 


CAGAGTGATA 


GCTCCTCATC 


CTCAACAAAC 


CAATTTGAGT 


180 


TCACTGAGAC 


ATCTCCAGAA 


AATTCCCCAG 


CATCAACTGA 


TGTAGATAGT 


TCAACAATGG 


240 


AACACGCTAG 


CCAGATTAAA 


ACTGAGAACG 


ATGACGTTGA 


GCCGTCAAGT 


GATCTTACAG 


300 


GAAGTGTTGA 


AGAGCTGGAT 


TTTGCTTCAT 


CACTACAACT 


ACAAGAAGGT 


GGTAAACTGG 


360 


AGGAGTCTAA 


AACATTAAAT 


ACTTCTGAAG 


AGACAATTAT 


TGATGAATCT 


GATAGGATCA 


420 


GAGAGAGGGG 


CATCCCTCCA 


CCTGGACTTG 


GTCAGAAGAT 


TTATGAAATA 


GACCCCCTTT 


480 


TGACAAACTA 


TCGTCAACAC 


CTTGATTACA 


GGTATTCACA 


GTACAAGAAA 


CTGAGGGAGG 


540 


CAATTGACAA 


GTATGAGGGT 


GGTTTGGAAG 


CI 1 1 1 ICTCG 


TGGTTATGAA 


AAAATGGGTT 


600 


TCACTCGTAG 


TGCTACAGGT 


ATCACTTACC 


GTGAGTGGGC 


TCCTGGTGCC 


CAGTCAGCTG 


660 


CCCTCATTGG 


AGATTTCAAC 


AATTGGGACG 


CAAATGCTGA 


CATTATGACT 


CGGAATGAAT 


720 


TTGGTGTCTG 


GGAGAI 1 1 1 1 


CTGCCAAATA 


ATGTGGATGG 


TTCTCCTGCA 


ATTCCTCATG 


780 


GGTCCAGAGT 


GAAGATACGT 


ATGGACACTC 


CATCAGGTGT 


TAAGGATTCC 


ATTCCTGCTT 


840 


GGATCAACTA 


CTCTTCACAG 


CTTCCTGATG 


AAATTCCATA 


TAATGGAATA 


TATTATGATC 


900 


CACCCGAAGA 


GGAGAGGTAT 


ATCTTCCAAC 


ACCCACGGCC 


AAAGAAACCA 


AAGTCGCTGA 


960 


GAATATATGA 


ATCTCATATT 


GGAATGAGTA 


GTCCGGAGCC 


TAAAATTAAC 


TCATACGTGA 


1020 


AIM IAGAGA 


TGAAGTTCTT 


CCTCGCATAA 


AAAAGCTTGG 


GTACAATGCG 


GTGCAAATTA 


1080 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
AATTTYATGG GNAAYGARTT YGG 
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9. Starch which as extracted from a potato plant by wet milling at ambient temperature 
has a pasting viscosity in the range 214 - 434 SNUs, as judged by viscoamylograph 
conducted according to the protocol defined in claim 1 . 

10. Starch which as extracted from a potato plant by wet milling at ambient temperature 
has a set-back viscosity in the range 450 - 618 SNUs, as judged by viscoamylograph 
conducted according to the protocol defined in claim 7. 

11. Starch which as extracted from a potato plant by wet milling at ambient temperature 
has a set-back viscosity in the range 14 - 192 SNUs, as judged by viscoamylograph 
conducted according to the protocol defined in claim 7. 

12. Starch which as extracted from a potato plant by wet milling at ambient temperature 
has a peak viscosity in the range 200 - 500 SNUs and a set-back viscosity in the range 
275-618 SNUs as judged by viscoamylograph according to the protocol defined in claim 
7. 

13. Starch which as extracted from a potato plant by wet milling at ambient temperature 
has a viscosity which does not decrease between the start of the heating phase (step 2) and 
the start of the final holding phase (step 5) and has a set-back viscosity of 303 SNUs or 
less as judged by viscoamylograph according to the protocol defined in claim 7. 

14. Starch which as extracted from a potato plant by wet milling at ambient temperature 
displays no significant increase in viscosity as judged by viscoamylograph conducted 
according to the protocol defined in claim 7. 

15. Starch which as extracted from a potato plant by wet milling at ambient temperature, 
is in accordance with claim 7 and in accordance with any one of claims 8 to 14. 



16. Starch according to any one of claims 7 to 15, having an amylose content in the range 
35 - 66%, as judged by the method of Morrison & Laignelet defined in claim 1. 
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enzyme (SBE) obtainable from potato plants. 
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28. A nucleotide sequence according to claim 27 . encoding a polypeptide comprising 
substantially the amino acid sequence of residues 49 to 882 of the sequence shown in 
Figure 5. 

29. A nucleotide sequence according to claim 27 or 28 , comprising substantially the 
sequence of nucleotides 289 to 2790 of the sequence shown in Figure 5, or a functional 
equivalent thereof. 

30. A nucleotide sequence according to claim 29, further comprising the sequence of 
nucleotides 145 to 288 of the sequence shown in Figure 5, or a functional equivalent 
thereof. 

31 . A nucleotide sequence according to claim 27, comprising the sequence of nucleotides 
228 to 2855 of the sequence labelled psbe2con,seq in Figure 8, or a functional equivalent 
thereof. 

32. A nucleotide sequence according to claim 27, comprising the sequence of nucleotides 
57 to 2564 of the sequence labelled as psbe2con.seq in Figure 12, or a functional 
equivalent thereof. 

33. A nucleotide sequence according to any one of claims 27 to 32, comprising an in- 
frame ATG start codon, and optionally including a 5' and/or a 3' untranslated region. 

34. A nucleotide sequence according to claim 27, comprising the sequence of nucleotides 
45 to 3200 of the sequence labelled as psbe2con.seq in Figure 8, or a functional equivalent 
thereof. 

35. A nucleic acid construct comprising a sequence in accordance with any one of claims 
27 to 34. 
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introducing into the plant one or more further sequences. 
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46. A method according to claim 45, wherein one or more of the further sequences are 
operably linked in the anti-sense orientation to a suitable promoter active in the plant. 

47. A method according to claim 45 or 46 , wherein the further sequence comprises a 
portion of a class B SBE nucleotide sequence. 

48. A method according to any one of claims 42 to 47, effective in altering the starch 
composition of a plant. 

49. A plant or plant cell having characteristics altered by the method of any one of claims 
42 to 48, or the progeny of such a plant, or part of such a plant. 

50. A plant according to claim 49, selected from one of the following: potato, pea, 
tomato, maize, wheat, rice, barley, sweet potato, and cassava. 

51. A tuber or other storage organ from a plant according to claim 49 or 50. 

52. Use of a tuber or other storage organ according to claim 51, in the preparation and/or 
processing of a foodstuff, 

53. A plant according to claim 49 or 50, containing starch which, as extracted from the 
plant by wet milling at ambient temperature, has an elevated viscosity onset temperature 
as judged by viscoamylograph conducted according to the protocol defined in claim 7, 
compared to starch extracted from a similar, but unaltered, plant. 

54. A plant according to claim 53, wherein the viscosity onset temperature is elevated by 
an amount in the range of 10 to 25°C. 

55. A plant according to claim 49 or 50, containing starch which, as extracted from the 
plant by wet milling at ambient temperature, has a decreased peak viscosity as judged by 
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63. A plant according to claim 49 or 50, containing starch which, as extracted from the 
plant, has a phosphorus content in excess of 200mg/100grams dry weight starch. 

64. Starch obtainable from a plant according to any one of claims 49, 50 or 53 - 63. 

65. Starch according to claim 64 and further in accordance with any one of claims 1 - 22. 

66. A method of modifying starch in vitro, comprising treating starch under suitable 
conditions with an effective amount of a polypeptide in accordance with any one of claims 
38 to 41. 



67. A potato plant or part thereof which, in its wild type possesses an effective SBE A 
gene, but which plant has been altered such that there is no effective expression of an SBE 
A polypeptide within the cells of at least part of the plant. 

68. A potato plant according to claim 67, wherein the alteration is effected by a method 
according to any one of claims 42-48. 
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TTGATGGGGCCTTGAACTCAGCAATTTGACACTCAGTTAGTTACA^ 
I I 1 H 1 1 | f- 

AACTACCCCGGAACTTGAGTCGTTAAACTGTGAGTCAATCAATGT 



AAGGAATGAATAAAAGGATAGATTTGTAAAAACCCTAAGGAGAGA 
1 : — I — ' — m — — ] — h h~ — i 1 

TTCCTTACTTATTTTCCTATCTAAACATTTTTGGGATTCCTCTCT 



M N K 



I D 



GTTCCATCAGTGTACAAATCTAATGGATTCAGCAGTAATGGTGAT 
1 r — I— 1 H h~ 1 ■ ... i 1 H 

CAAGGTAGTCACATGTTTAGATTACCTAAGTCGTCATTACCACTA 
VPSVYKSNG'FSSNGD 

: B 9' 11 ; EcoR I 

TCACGGAAGATCTTGGCTGAAAAGTCTTCTTACAATTCCGAATTC 
1 —4—- — H — ■ — i ' — I t~ 1 

AGTGCCTTCTAGAACCGACTTTTCAGAAGAATGTTAAGGCTTAAG 
SRKILAEKSSYNSEF 

ACCCAGAGTGATAGCTCCTCATCCTCAACAGACCAATTTGAGTTC 
' H- 1 1 =-h 1 1- — h 

TGGGTCTC ACTATCGAGGAGTAGGAGTTGTCTGGTTAAACTCAAG 
TQSDSSSS ST DQ.FEF 

AGTTCAACAATGGAACACGCTAGCCAGATTAAAACTGAGAACGAT 
• H 1 H -h- 1 1 1 «— t- 

TCAAGTTGTTACCTTGTGCGATCGGTCTAATTTTGACTCTTGCTA 
SST MEHASQ1KTENQ 

GATTTTGCTTCATCACTACAACTACAAGAAGGTGGTAAACTGGAG 
" H — 1 — i 1 1 1 — 1 1 i 1 ' ' — | — — +- 

CTAAAACGAAGTAGTGATGTTGATGTTCTTCCACCATTTGACCTC 
D FASSLQLQEGGKL EJ 



► Fig 5 
Sheet 2 
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iCTGATAGGATCAGAGAGAGGGGCATCCCTCCACCTGGACTTGGT 
' 1 1 1 I 1 H — — i ~f_ — H 

AGACTATCCTAGTCTCTCTCCCCGTAGGGAGGTGGACCTGAACCA 
S0R I RERGIPPPGLG 

CACCTTGATTACAGGTATTCACAGTACAAGAAACTGAGGGAGGCA 

1 ' ' ' 1 • ' 1 H ■ | ■ h 

GTGGAACTAATGTCCATAAGTGTC ATGTTCTTTGACTCCCTCCGT 
HLDy RYSQYKKLREA 



GAAAAAATGGGTTTCACTCGTAGTGCTACAGGTATCACTTACCGT 

1 h— H 1 -H 1 , 1 , . ,. 

CTTTTTTACCCAAAGTGAGCATCACGATGTCCATAGTGAATGGCA 
EKM G FTRSATG I T Y R 

AACAA TTGGGACGCAAATGCTGAC ATTATGACTCGGAATGAATTT 
1 " ' 1 1 : — H — ( — . 1 

TTGTTAACCCTGCGTTTACGACTGTAATACTGAGCCTTAGTTAAA 

NNWDAN AD I MTRNEF 

GCAATTCCTCATGGGTCCAGAGTGAAGATACGTATGGACACTCCA 
1 H : 1 ' I ' 1 1 | 1- 

CGTTAAGGAGTACCCAGGTCTCACTTCTATGCATACCTGTGAGGT 
AIp HGS R VKIRMDTpJ 
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CAGCTTCCTGATGAAATTCCATATAATGGAATACATTATGATCCA^ 

H 1 t— H h- 1 — H , 1 , h 

GTCGAAGG ACT AC TJ.TAAGGTATATTACCTT AT G T A AJ^A G T A G G T 
Q L P D E I P YNG I HYD P 

CCAAAGTCGCTGAGAATATATGAATCTCATATTGGAATGAGTAGT 
" — H H— — — ■ ■ | 1 — — >-+• 

GGTTTCAGCGACTCTTATATACTTAGAGTATAACCTTACTCATCA 
PKS LRIYESHIGMSS 

: HinD III 

CTTCCTCGCATAAAAAAGCTTGGGTACAATGCGCTGCAAATTATG 
' K -t H — > — ^ H= h~ — h — — *- 

GAAGGAGCGTATTTTTTCGAACCCATGTTACGCGACGTTTAATAC 
LPRIK'KL. GYNALQIM 

ACAAATTTTTTTGCACCAAGCAGC.CGTTTTGGAACGCCCGACGAC 

i 1 [ 1 I 1 '' | I ! | 

TGTTTAAAAAAACGTGGTTCGTCGGCAAAACCTTGCGGGCTGCTG 
TNFFAPSS R F GTPDD 

v> C TC ATGGACATTGTTCACAGCCATGCATCAAATAATACTTTAGAT 
1 1 — +- H 1 1 — ) — — < — H — ' — 

GAGTACCTGTAACAAGTGTCGGTACGTAGTTTATTATGAAATCTA 
LMDI VHS HASNNT L D^ 
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Sac I . - . jc-,*U. ■ - ■ -.- 

GGAGCTCGTGGTTATCATTGGATGTGGGATTCCCGCCTCTTTAAC 

1 1 ... i i ... | i • ■ 1 1 1 i 1 ' 

CCTCGAGCACCAATAGTAACCTACACCCTAAGGGCGGAGAAATTG 

G A R G Y H W M W D S R L F N 



TGGTGGTTGGATGCGTTCAAATTTGATGGATTTAGATTTGATGGT 

, 1_ — | i H — ■ 1 ' 1 1 1 — 1 — 

ACCACCAACCTACGCAAGTTTAAACTACCTAAATCTAAACTACCA 

WWLDA F KFDGFR FDG 



ACTGGGAACTACGAGGAATACTTTGGACTCGCAACTGATGTGGAT p ig 5 

< 1 1 1 1 1 1 ,1,11 r Sheet 8 

TGACCCTTGATGCTCCTTATGAAACCTGAGCGTTGACTAC ACCT A 

/F'EYFGLATDVD 



T 



N 



TTCCCAGATGCAATTACCATTGGTGAAGATGTTAGCGGAATGCCG 

, 1 — 1 ■ 1 I ' 1 1 11 1 

AAGGGTCTACGTTAATGGTAACCACTTCTACAATCGGCTTACGGC 

GEDVSGMP 



D 



I 



1 



CGGCTGCATATGGCAATTGCTGATAAACGGATTGAGTTGCTCAAG 

^ — — 1 I 1 1 I .' ] 1 1 I ' I 

GCCGACGTATACCGTTAACGACTATTTGCCTAACTCAACGAGTTC 

R I E L L K 



L H M 



I 



D K 



AC AAAT AGAAGATGGTCGGAAAAGTGTGTTTC AT ACGCTGAAAGT 

, 1 1 1 1 1 — < 1 — ' 1 1- 

TGTTTATCTTCTACCAGCCTTTTCACACAAAGTATGCGACTTTCA 

TNRRWSEKCVSYAESV 
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.Hinc II 



ATGGACAAGGATATGTATGATTTTATGGCTCTGGATAGACCGTCA^ 

1 1 — > i 1 h 1 ... . . i — ■ — ^ 1- 

TACCTGTTCCTATACATACTAAAATACCGAGACCTATCTGGCAGT 
MDKDMYDFMALDRPS 



Asp 718 
; Kpnl 

CTTGTAACTATGGGATTAGGAGGAGAAGGGTACCTAAATTTCATG 

1 1 1 I 1 1 . . . . i 1 

GAACATTGATACCCTAATCCTCCTCTTCCCATGGATTTAAAGTAC 
-VTM.GLGGEGYL NFM 

GAACAACACCTCTCTGATGGCTCAGTAATCCCCGGAAACCAATTC F : rt c 
. 1 1 f— 1 1 ~h 1 h V£ig.5 

CTTGTTGTGGAGAGACTACCGAGTCATTAGGGGCGTTTGGTTAAG ^W^O 

EQHLSDGSVIPGNQF 

Ssp I 

TATTTAAGATACCGTGGGTTGCAAGAATTTGACCGGCCTATGCAG 

— 1 1 i . i i . l . . i i | | i i . i 

ATAAATTCTATGGCACCCAACGTTCTTAAACTGGCCGGATACGTC 
YLRYRG LQEFDRPMQ 

ATATCACGAAAGGATGAA.GGAGATAGGATGATTGTATTTGAAAAA 
1 1 1 1 1 1 1 1 h 

T ATAGTGCTTTCCT ACTTCCTCTATCCTACTAACATAAACTTTTT 

I S R K 



D E 



M I 



K 



TCAGACTATCGCATAGCCTGCCTGAAGCCTGGAAAATACAAGGTT 
1 r- 1— 1 1 ; — I I 1 ... I 

AGTCTGATAGCGTATCGGACGGACTTCGGACCTTTTATGTTCCAA 
S 0 Y R 



I 



K 



K 



V J 
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Ssp 



GATCATAATGCCGAATATTTCACCTTTGAAGGATGGTATGATGAT^ 

*- — ■ ■ i | ■ 1 1 i — ■ j ■ ... < 

CTAGTATTACGGCTTATAAAGTGGAAACTTCCTACCATACTACTA 
D HNAEY FTFEGWYDD 

GTCTATGCACTAGTAGACAAAGAAGAAGAAGAAGAAGAAGAAGAA 
1 1 — ' I h H -t- 1— — h 

CAGATACGTGATCATCTGTTTCTTCTTCTTCTTCTTCTTCTTCTT 
v Y AL V D K E E EEEEEE 



TGAACGAACTTGTGATCGCGTTGAAAGATTTGAACGCTACATAGA 
1 1 < — H ' — i ' — I h« — > — I ~+ 

ACTTGCTTGAACACTAGCGCAACTTTCTAAACTTGCGATGTATCT 



TCATGTGACACAAGGTTTGCAATTCTTTCCACTATTAGTAGTGCA 
1 1 ' ' — ■ — H ~h 1 1 1 h 

AGTACACTGTGTTCCAAACGTTAAGAAAGGTGATAATCATCACGT 

: EcoR I Pst I 

GATGAATTTATGTCGAATGCTGGGACGATCGAATTCCTGCAGGCC 
1 1— 1 1 l 1 1 ■—♦—+- 

CTAC TT AAATACAGCTTACGACCC TGCTAGC TTAAGGACGTCCGG 
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f180 ^190 f200 *2)0 *220 

I YE I DPLLTNYRQHLDYRYSQYKKLREA I DKYEGGLEAFSRG YEKMGFTR 
: : : DP L. Y : H: . R . : Y . : I: KYEG LE. F: : GY K. GF. R 
LLNLDPTLEPYLDHFRHRMKRY VDQKMLI EKYEGPLEEFAQG YLKFGFNR 

*-100 ^110 *120 *-130 ^140 

f230 ^250 ^260 ^270 

SATG I TYREWALGAQSAAL I GDFNNWDANAD I MTRNEFGVWE I FLPNNVD 
. . . I. YREWA : AQ. A. : IGDFN. W: : : . : : M. : : : FGVW. I : P: VD 
EDGC I VYREWAPAAQEAEV IGDFNGWNGSNHMMEKDQFGVWS IR IPD-VD 

*-150 *-160 ^170 ^180 A 190 

*-280 *-290 ^300 ^310 ^320 

GSPAIPHGSRVK I RMDTPSGV-KDS I PAW I NYSLQLPDE I --PYNGIHYD 
: . P. I PH. SRVK: R. . : GV D. I PAW I : Y: . PY: G: . . D 

SKPV I PHNSRVKFRFKHGNGVWVDR I P AW IKYATADATKFAAPYDGVYWD 

^200 *-210 *-220 *-230 *-240 

^330 ^340 ^350 ^360 ^370 

PPEEERY I FQHPRPKKPKSLRI YESH I GMSSPEPK 1 NSY VNFRDEVLPR 1 
PP . ERY F: . PRP KP: : R I YE: H: GMSS: EP: : NSY : F D: VLPRI 
PPPSERYHFKYPRPPKPRAPR I YEAHVGMSSSEPRVNSYREFADDVLPR I 

*-250 ^-260 ^270 ^280 ^290 

>r380 >r390 f-400 ^10 *-420 

KKLGYNALQIMAI QEHS Y YASFGYH VTNFFAPSSRFGTPDDLKSL I DKAH 
K . YN: : Q: MAI EHSYY: SFGYHVTNFFA S: R: G. P: DLK LIDKAH 
KANN YNT VQLMA I MEHS YY GSFGY HVTNFFAV SNRYGNP EDLK Y L I DKAH 

*\300 *-310 ^320 *\330 *340 

*-430 >f440 >450 f460 ^470 

ELG I VVLMD I VHSHASNNTLDGLNMFDC TDSC YFHSGARGYHWMWDS 

. LG: VL: D: VHSHASNN. DGLN FD : : . . YFH: G. RGYH : WDS 
SLGLQVL VD VVHSHASNNVTDGLNGFD I GQGSQESYFHAGERGYHKLWDS 

^350 *360 *-370 *-380 ^390 

f480 ^490 ^500 <r510 ^520 

RLFN YGNWE VLR YLLSNARWWLDAFKFDGFRFDGVTSMMY I HHGLSVGFT 
RLFNY: NWEVLR: LLSN RWWL: . : : FDGFRFDG: TSM: Y: HHG: : : GFT 
RLFN YANWE VLRFLLSNLRWWLEEYNFDGFRFDG I TSML YVHHG I NMGFT 

*-400 A 410 H20 *-430 *440 

>r530 ^-540 ^550 *-560 ^570 

GNYEEYFGLATDVDAVV YLMLVNDL IHGLFPDAI T I GEDVSGMPTFC I PV 
GNY: EYF: ATDVDA VVYLML. N: LIH : FPDA. . I: EDVSGMP. : . PV 
GN YNEYF SE ATD VD AV V YLMLANNL I HK I FPD AT V I AED VSGMPGLSRPV 

*-450 *-460 *470 *-480 *-490 

*-580 *-590 f-600 ^610 *-620 

QEGGVGFDYRLHMA 1 ADKR I ELLK -KRDEDWR VGD i VHTLTNRRWSEKCV 
EGG: GFDYRL MAI:DK:I: LK K. DEDW. : ::. : LTNRR. : EKC: 
SEGG I GFDYRLAMA 1 PDKW I DYLKNKNDEDWSMKEVTSSLTNRR YTEKC I 

*-500 *510 *520 *-530 *540 
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HO *-20 >r30 ,f40 

MV YTLSG VRFPT VPSVYKSNGFSSNGDRRNAN VSVFLKKH--SLSRK I LA 
MVYT: SG: RFP. : PS: . KS : . DRR. : : S FLK: : S: SR L 
MVYTISG IRFPVLPSLHKS TLRCDRR ASSHSFFLKNNSSSFSRTSLY 

^10 *-20 *\30 ^40 

f50 ^60 ?70 ^80 ^90 

EKSSYNSEFRPSTVAASGKVLVPGTQSDSSSSSTDQFEFTETSPENSPAS 
. K S : SE : : ST: A. S: KVL: P. . Q D: S S : DQ: E . : . : : E: : 
AKFSRDSETKSST I AESDK VL I PEDQ- DNSVSLADQLENPD I TSEDAQNL 
*50 *60 *70 *-80 *-90 
^100 >r1 10 *-120 ^130 ^140 
TDVDSSTMEHASQ I KTENDDVEPSSDLTGSVEELDFASSLQLQEGGKLEE 
. Di TM. ; i i i \,i i * • • * * • s 

E D L J M K D G N K Y N I D - E S T S S Y R E V G D E K G S V T S S S L V D V N T D T Q - - A 
^100 ^1 10 ^120 ^130 * 

>r150 ^160 ^1 70 ^180 ^190 

SKTLNTSEETI I DESDR I RERG I PPPGLGQK I YE I DPLLTNYRQHLD YRY 
• KT S: . . : :. :I IPPPG GQKIYEIDPLL . . RQHLD: RY 

KKTS VHSDKKVK VDKPK I I PPPGSGQK I YE I DPLLQAHRQHLDFRY 

^150 ^160 *170 *180 

^200 *-210 ^220 ^230 ^240 

SQYKKLREA I DK YEGGLEAFSRGYEKMGFTRSATG ITYREWALGAQSAAL 
: QYK: : RE. IDKYEGGL: AFSRGYEK. GFTRSATG I TYREW: GA: SAAL 
GQYKRIREEIDKYEGGLDAFSRGYEKFGFTRSATGITYREWGPGAKSAAL 
^190 *200 *-210 ^220 ^230 

*-250 ^260 ^270 ^280 *-290 

1 GDFNNWDANAD I MTRNEFGVWE I FLPNN VDG SPA IPHGSRVK I RMDT PS 
: GDFNNW: : NAD: MT: : . FGVWE I FLPNN. DGSP: IPHGSRVKI: MDTPS 
VGDFNNWNPNAD VMTKDAFGVWE I FLPNN ADGSPP I PHGSRVK I HMDTPS 
^240 ^250 *260 ^270 *-280 

^300 ^310 f-320 *-330 ^340 

GVKDS I PAW I NYSLQLPDE I PYNG I HYDPPEEER Y IFQHPRPKKPKSLRI 
G: KDSIPAW1: : S: Q P: E 1 PYNG I . YDPPEEE: Y: F: HP: PK: P: S: RI 
GI KDSIPAW IKFSVQAPGE I PYNG I YYDPPEEEK YVFKHPQPKRPQS IRI 
*290 *300 *310 ^320 ^330 

^350 ^360 f-370 ^380 r390 

YESH I GMSSPEPK I NSY VNFRDEVLPR I KKLGYNALQ IMAIQEHSYYASF 
YESH I GMSSPEPK I N: Y. NFRD: VLPR I KKLGYNA: Q I MA I QEHSYYASF 
YESH IGMSSPEPKIN'TYANFRDD.VLPRIKKLGYNAVQ IMAIQEHSYYASF 
^340 *-350 *-360 ^370 ^380 

<r400 >r410 f420 ^430 ^440 

GYHVTNFFAPSSRFGTPDDLKSL I DKAHELG I VVLMD I VHSHASNNTLDG 
GYHVTNFFAPSSRFGTP: DLKSLID: AHELG: : VLMDIVHSH: SNNTLDG 
GYHVTNFFAPSSRFGTPEDLKSL I DRAHELGLLV LMD I VHSHSSNNTLDG 
*-390 ^400 *410 ^420 *430 
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^AACOTGtCCAforC^GtC 



72 tttctcttaattccaaccaBgcBaatgaataaaaggat-a 

73 tttctcttaattccaaccaagg-aatgaataaaaggat-a 
71 tttctcttaattccaaccaagg-aatgaataaaa0gat-a 

165 TTTCTCTTAATTCCAACCAAGG-AATGAATfiAAAEjGATijA 

191 TGTAC A A ATCTA ATG G ATTCA G C A GTA ATG GTGATC G G AG 

191 TGTA C A A ATCTA ATG G ATTCAG C A GTA ATG GTGATC G G AG 

189 TGTA CA A ATCTA ATG G ATTCAG C A GTA ATG GTGATC G G A G 

274 TGTA C A A ATCTA ATG G ATTCA G C A GTAATG GTGAT C G G A G 

311 A ATTC C G A C CTTCTA C A GTTG C A G CATC G G G G A A A GT C CT 

311 AATTCCGACCTTCTACAGTTGCAGCATCGGGGAAAGTCCT 

309 aatBccgaccttctacaSttgcagcatcggggaaagtcct 

394 aatgccgaccttctacagttgcagcatcggggaaagtcct 

431 cagcatcaactgatgtagatagttcaacaatggaacacgc 

431 cagcatcaactgatgtagatagttcaacaatggaacacgc 

429 cagcatcaactgatgtagatagttcaacaatggaacacgc 

514 cagcatcaactgatgtEgatagttcaacaatggaacacgc 

551 c atca cta c a a ct a c a a g a a g gtg gta a a ctg g a g g a gtc 

551 c atcacta caa cta ca a g a a g gtg gta a actg g a gg a gtc 

549 catcactacaactacaagaaggtggtaaactggaggagtc 

634 catc acta caa cta caa g aa g gtg gta aa ctg g a gga gtc 



671 TTGGTCAGAAGATTTATGAAATAGACCCCCTTT 

671 TTGGTCAGAAGA 

669 TTGGTCAGAAGA 

754 TTGGTCAGAAGA 



TGACAAA 

ATGAAATAGACCCCCTTTTGACAAA 
ATGAAATAGACCCCCTTTTGACAAA 
ATGAAATAGACCCCCTTTTGACAAA 



791 aagqjttttctcgtggttatgaaaaaatgggtttcactcg 

791 aagc Btttt ctcgtggttatgaaaaaatgggtttcactcg 

789 aagctttttctcgtggttatgaaaEaatgggtttcactcg 

874 AAGCT 



TTTCTCGTGGTTATGAAAAAATGGGTT 
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ACTCCTATCACTTATCAGATCTCTATTT llcon.seq 
ACTCCTATCACTTATCAGATCTCTATTT 19con . seq 
ACT@cBaTCACTTATCAGATCTCTATTT 10con.seq 
ACTCCTATCACrBATCAGATCTCTATTT psbe2con , seq 

GGAGTTCGTTTTCCTACTGTTCCATCAG llcon .seq 
GGAGTTCGTTTTCCTACTGTTCCATCAG 19con.seq 
GGAGTTCGTTTTCCTACTGTTCCATCAG 10con . seq 
GGAGTTCGTTTTCCTACTGTTCCATCAG psbe2con . seq 

TTGGCTGAAAAGTCTTCTTACAATTCCG llcon . seq 
TTGGCTGAAAAGTCTTCTTACAATTCCG 19con.seq 
TTGGCTGAAAAGTCTTCTTACAATTCCG 10con . seq 
TTGGCTGAAAAGTCTTCTTACEaTTCCG psbe2con . seq 

TTCACTGAGACATCTCCAGAAAATTCCC llcon . seq 
TTCACTGAGACATCTCCAGAAAATTCCC 19con.seq 
TTC@CTGAGACATCTCCAGAAAATTCCC 10con.seq 
TTCACTGAGACaBcTCCAGAAAATTCCC psbe2con . seq 

GGAAGTGTTGAAGAGCTGGATTTTGCTT llcon . seq 

GGAAGTGTTGAAGAGCTGGATTTTGCTT 19con . seq 

GGAAGTGTTGAAGAGCTGGATTTTGCTT 10con . seq 

GGAAGTGTTGAAGACfiTGGATTTTGCTT psbe2con . seq 

AGAGAGAGGGGCATCCCTCCACCTGGAC llcon . seq 
AGAGAGAGGGGCATCCCTCCACCTGGAC 19con . seq 
AGAGAGAGGGGCATCCCTCCACCTGGAC 10con . seq 
AGAGAGAGGGGCATCCCTCCACCTGGAC psbe2con . seq 

GCAATTGACAAGTATGAGGGTGGTTTGG llcon.seq 
GCAATTGACAAGTATGAGGGTGGTTTGG 19con . seq 
GCAATTGACAAGTATGAGGGTGGTTTGG 10con .seq 
GCAATTGACAAGTATGAGGGTGGTTTGG psbe2con . seq 

GCCCTCATTGGAGATTTCAACAATTGGG llcon . seq 
GCCCTCATTGGAGATTTCAACAATTGGG 19con . seq 
GCCCTCATTGGGGATTTCAACAATTGGG 10con.seq 
G (jj CTCATTG G AGATTTCAA C A ATTGG G psbe2con.seq Fjg. 8 
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TGGGAGATTTTTCTGCCAAATAATGTGGATGGTTCTCCTGCAATTC 
TGGGAGATTTTTCTGCCAAATAATGTGGATGGTTCTCCTGCAATTC 
TCfiGAGATTTTTCTGCCAAATAATGTGGATGGTTCTCCTGCAATTC 
TGGGAGATTTTTCTGCCAAATAATGTGGATGGTTCTCCTGCAATTC 

TACTCTTTACAGCTTCCTGATGAAATTCCATATAATGGAATATATT 
TACTCTTTACAGCTTCCTGATGAAATTCGATATAATGGAATAgATT 
TACTCTTTACAGCTTCCTGATGAAATTCCATATAATGGAATATATT 
TACTCTTTACAGCTTCCTGATGAAATTCCATATAATGGAATATATT 

GAATCTCATATTGGAATGAGTAGTCCGGAGGCTAAAATTAACTCAT 
GAATCTCATATTGGAATGAGTAGTCCGGAGCCTAAAATTAACTCAT 
GAATCTCATATTGGAATGAGTAGTCCGGAGCCTAAAATTAAGTCAT 
GAATCTCATATTGGAATGAGTAGTCCGGAGCCTAAAATTAACTCAT 

t c a a g a g c attctt att atg cta gttttg gtt atc atgtc a c a a at 
tcaagagcattcttattaBgctagttttggttatcatgtcacaaat 
tc aag a g cattcttattatg ctagttttg gttatcatgtcac a aat 

TCAAGAGCATTCTTATTATGCTAGTTTTGGTTATCATGTCACAAAT 

aattgttgttctcatggacatSgttcacagccatgcatcaaataat 
aattgttgttctcatggacattgttcacagccatgcatcaaataat 
aattgttgttctcatggacattgttcacagccatgcatcaaataat 
aattgttgttctcatggacattgttcacagccatgcatcaaataat 

gatgtgggatljjccgcctctttaactatggaaactgggaggtactt 
gatgtgggattcccgcctctttaactatggaaactgggaggtactt 
gatgtgggattBccgcctctttaactatggaaactgggaggtactt 
gatgtgggattcccgcctctttaactatggaaactgggaggtactt 

atcaatgatgtatactcaccacggattatcggtgggattcactggg 
atcaatgatgtataBtcaccacggattatcggtgggattcactggg 
atcaatgatgtBtactcaccacggattatcggtgggattcactggg 
atcaatgatgtatactcaccacggattatcggtgggattcactggg 

tcatBggcttttcccagatgcaattaccattggtgaagatgttagc 
tcatgggcttttcccagatgcaattaccattggtgaagatgttagc 
tcatgggcttttcccagatgcaattaccattggtgaagatgttagc 
tcatgggcttttcccagatgcaattaccattggtgaagatgttagc^ 



Fig. 8 
> Sheet 6 



Fig. 8 

SHEET 5 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 9634968A2> 



WO 96/34968 




PCT/GB96/01075 



30/75 



1868 GGGGGTGTTGGCTTTGACTATCGGCTGCATATGGCAATTGO 
1870 GGGGGTGTTGGCTTTGACTATCGGCTGCATATGGCAATTGC 

1869 GGGGGTGTTGGCTTTGACTATCGGCTGCATATGGCAATTGC 
1953 GGGGGTGTTGGCTTTGACTATCGGCTGCATATGGCAATTGC 

1988 AGATGGTCGGAAAAGTGTGTTTCATACGCTGAAAGTCATGA 

1990 agatggtcggaaaagtgtgtttcatacgctgaaagtcatga 

1989 agatggtcggaaaagtgtgtttcatacgctgaaagtcatga 
2073 agatggtcggaaaagtgtgtttcatacgctgaaagtcatga 

2108 ccgBcaacatcattaatagatcgtgggatagcattgcacaa 
2110 ccgtcaacatcattaatagatcgtgggatagcattgcacaa 

2109 ccgtcaacatcattaatagatcgtgggatagcatt0cacaa 
2193 ccgtcaacatcattaatagatcgtgggatagcattgcacaa 

2228 tggattgatttccctagggctgaeceacaccliitctgatgg 
2230 tggattgatttccctagggctgaacaacacctctctgatgg 

2229 tggattgatttccctagggctgaagaacacctctctgatgg 
2313 tggattgatttccctagggctgaacaacacctctctgatgg 

2348 taccbtgggttecaagaatttgaciigggctatgcagtatct 
2350 taccgtgggttgcaagaatttgaccggBctatgcagtatct 

2349 taccgtgggttgcaagaatttgaccgggctatgcagtatct 
2433 taccgtgggttgcaagaatttgaccgggctatgcagtatct 

2468 GAAaBaGGAAACCTAGTTTtEgTCTTTAATTTTCACTGGAC 
2470 GAAAAAGGAAACCTAGTTTTTGTCTTTAATTTTCACTGGAC 

2469 GAAAAAGGAAACCTAGTTTTTGTCTTTAATTTTCACTGGAC 
2553 GAAAAAGGAAACCTAG I I I I I GTCTTTAATTTTCACTGGAC 

2588 TTTGGTGGCTTCGGGAGAATTGATCATAATGCCGAATATTT 
2590 TTTGGTGGCTTCGGGAGAATTGATCATAATGCCGAATATTT 

2589 TTTGGTGGCTTCGGGAGAATTGATCATAATGCCGAATATTT 

2673 tttggtggcttcgggagaattgatcataatgccgaatBttt 

2708 ctagtagacaaaBjagaacBB^HB BHE 

2710 CTAGTAGACAAAGAAGAAGAAGAAGAAGAAGAA Gfi?^^^ 

2709 CTAGTAGACAAAGAAGAAGAAG AAGAAGAAGAAG- 

2793 CTAGTAGACAAAGAAGAAGAAGAAGAAGAAgBH 
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GTGGGTGATATTGTTCATACACTGACAAATAGA llcon . seq 
GTGGGTGATATTGTTCATACACTGACAAATAGA 19con . seq 
GTGGGTGATATTGTTCATACACTGACAAATAGA 10con . seq 
GTGGGTGATATTGTTCATACACTGACAAATAGA psbe2con.seq 

AAGGATATGTATGATTTTATGGCTCTGGATAGA llcon. seq 
AAGGATATGTATGATTTTATGGCTCTGGATAGA 19con . seq 
AAGGATATGTATGATTTTATGGCTCTGGATAGA 10con . seq 
AAGGATATGTATGATTTTATGGCTflTGGATAGA psbe2con . seq 

AATTTCATGGGAAATGAATTCGGCCACCCTGAG llcon . seq 
AATTTCATGGGAAATGAATTCGGCCACCCTGAG 19con . seq 
AATTTCATGGGAAATGAATTCGGCCACCCTGAG 10con . seq 
AATTTCATGGGAAATGAATTCGGCCACCCTGAG psbeZcon . seq 

AGATTTGACCTGGGAGATGCAGAATATTTAAGA llcon . seq 
AGATTTGACCTGGGAGATGCAGAATATTTAAGA 19con . seq 
A G ATTT G A C CTG G G A G ATG C A GA ATATTTA A G A 10con . seq 
AGATTTGACCTGGGAGATGCAGAATATTTAAGA psbe2con . seq 

CGAAAGGATGAAGGAGATAGGATGATTGTATTT llcon . seq 
CGAAAGGATGAAGGAGATAGGATGATTGTATTT 19con . seq 
CGAAAGGATGAAGGAGATAGGATGATTGTATTT 10con . seq 
CGAAAGGATGAAGGAGATAGGATGATTGTATTT psbe2con . seq 

TACAAGGTT(f|CTTGGACtCAGATGATCCACtT llcon . seq 
TACAAGGTTGCCTTGGACTCAGATGATCCACTT 19con . seq 
TACAAGGTTGCCTTGGACTCAGATGATCCACTT 10con.seq 
TACAAGGTTGCCTTGGACTCAGATGATCCACTT psbe2con . seq 

TATGCACCTAGTAGAACAGCAGTGGTCTATGCA llcon . seq 

tatgcacctUgtaEaacagcagtggtctatgca 19con . seq 

TATGCACCTA GTA G A A CAGCAGTGGTCTATGCA 10con . seq 
TATGCACCTAGTAGAACAGCAGTGGTCTATGCA psbe2con . seq 
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AACTTGTGATCGCGTTGAAAGATTTGAACCflfg llcon . seq 

AACTTGTGATCGCGTTGAAAGATTTGAACG 19con . seq 

AACTTGTGATCGCGTTGAAAGATTTGAACG 10con . seq 

AACTTGTGATCGCGTTGAAAGATTTGAACG-— psbeZcon . seq 
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^tcagtcttggcggaatt5 catgtgacaaSaaggtttgcaEtt 
tgcatcagtcttggcggaatttcatgtgacag-aaggtttgcaatt 
tgcatBagtcttggcggaatttcatgtgacaa-Saggtttgcaatt 

TGCATCAGTCTTGGCGGAATTTCATGTGACAA-AAGGTTTGCAATT 



TTTATGTCGAATGCTGGGACGATCGAATTCCTGCAGCC 
TTTATGTCGAATGCTGGGACGATCGAATTCCTGCAG 

TTTATGTCGAATGCTGGGACGATCGAATTCCTGCAGCC 
TTTATGTCGAATGCTGGGAC 



gTTTTGCTTAGTGA 



CATAAAATGGAAATAGTGCTGATCrAATGATGTTTTAAMCCNMMMA 
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GGATGCTAATG.TTTCTGT ATTCTTGAAAAAGCACTCTCTTTCACGG 
I 1 1 : — i 1 — ■ h --— H — J , , . | . . — ^ 

CCTACGATTACAAAGACATAAGAACTTTTTCGTGAGAGAAAGTGCC 
ANVSVF LKKHSLSR 

TTCTACAGTTGCAGCATCGGGGAAAGTCCTTGTGCCTGGAAYCCAG 
1 1 1 ' ■ i 1 1 I ' i ■ 1 ■ ■ ■ ■ i 1 1— 

AAGATGTCAACGTCGTAGCCCCTTTCAGGAACACGGACCTTRGGTC 
ST'VAASGKVLVP'G?Q 

GACATCTCCAGAAAATTCCCCAGCATCAACTGATG'TAGATAGTTCA 
-i : H 1 i . | ■ ■ 1 — H < 1 h— 

CTGTAGAGGTCTTTTAAGGGGTCGTAGTTGACTACATCTATCAAGT 
TSPENSPAST DVDSS 

TGAG.CCGTCAAGTGATCTTACAGGAAGTGTTGAAGAGCTGGATTTT 
— — I— - — -i I — 1 1 1 h— 

ACTCGGC AGTTCACTAGAATGTCCTTCACAACTTCTCGACCTAAAA 
EPSSDL'TGSVEELDF 

T AAAAC ATT AAATACTTCTGAAGAGACAATTATTGATGAATCTGAT 
1 i 1 1 1 ■ I ■ 1 1 1 i ■ ' h— — i — - 

ATTTTGTAATTTATGAAGACTTCTCTGTTAATAACTACTTAGACTA 
K TLNTSEET I I DESD 

; Hinc II 

GATTTATGAAATAGACCCCCTTTTGAC AAACTATCGTC AAC ACCTT 
1 1 H— — i — . ■ ■ | ■ ; 1 ' i i i i i i 

CTAAAT AC TTTATCTGGGGG AAAAC TGTTTGATAGCAGTTGTGGAA 
I YEIDPLLTNYRQHL^ 
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JHinD III 

CAAGTATGAGGGTGGTTTGGAAGCTTTTTCTCGTGGTTATGAAAAA^ 

1 f— -H 1 1 (— « — , | ., 

GTTCATACTCCCACCAAAC'CTTCGAAAAAGAGCACCAATACTTTTT 
KY E.G GLEAFSR GYE 



K 



Pvu II 



GGGTCCTGGTGCCCAGTCAGCTGCCCTCATTGGAGATTTCAACAAT 
' H 1 1— — i — — H — — h , — | 

CCGAGGACCACGGGTCAGTCGACGGGAGTAACCTCTAAAGTTGTTA 

Ap GAQ S AAL IGD FNN 

CTGGGAGATTTTTCTGCCAAATAATGTGGATGGTTCTCCTGCAATT 
1 H 1 ■ | t 1 ^ H 

GACCCTCTAAAAAGACGGTTTATTACACCTACCAAGAGGACGTTAA 

W EI FLPNNVDGSPAI 

TGTTAAGGATTCCATTCCTGCTTGGATCAACTACTCTTTACAGCTT 
1 1 — — — H 1 I i . . i , . 

AC AATTCCTAAGGTAAGGACGAACCTAGTTGATGAGAAATGTCGAA 
v K D S I P A W I N Y S L Q L 

AGAGGAGAGGTATRTCTTCCAACACCCACGGCCAAAGAAACCAAAG 
— 1 1 ■ I 1 * 1 — — H 1 1 ^ 

TCTCCTCTCCATAYAGAAGGTTGTGGGTGCCGGTTTCTTTGGTTTC 
E ERY?FQHPRPK KPK-/ 
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Xmn I 

GCCTAAAATTAACTCATACGTGAATTTTAGAGATGAAGTTCTTCCT^ 
^ — ~H 1 — 1 1 1 1 1 I 1 i , , , , i , 

CGGATTTTAATTGAGTATGCACTTAAAATCTCTACTTCAAGAAGGA 
P K IN SYVNFRDE VL P 

TCAAGAGCATTCTTATTATGCTAGTTTTGGTTATCATGTCACAAAT 
1 I — — H — ' — i 1 1 . i , , ■ , i ■ 

AGTTCTCGTAAGAATAATACGATCAAAACCAATAGTACAGTGTTTA 
QEHSYYASFGYHVTN 

GTCTTTGATTGATAAAGCTCATGAGCTAGGAATTGTTGTTCTCATG 

' "1" 1 1 I 1 I 1 1 1 1 I I 1 1 1 ' 1 H— 

CAGAAACTAACTATTTCGAGTACTCGATCCTTAACAACAAGAGTAC 
SLIDKAHELGIV VLM 

GAACATGTTTGACGGC ACAGATAGTTGTTACTTTCACTCTGGAGCT 
' 'I 1 1 I 1 1 1 H 1 — ~H h- 

CTTGTAC AAACTGCCGTGTCTATC AACAATGAAAGTGAGACCTCGA 
NMFDGTDSCYFHS GA 

AAACTGGGAGGTACTTAGGTATCTTCTCTCAAATGCGAGATGGTGG 
' 1 1 1 1 1 1 

TTTGACCCTCCATGAATCCATAGAAGAGAGTTTACGCTCTACCACC 
NWE VLRYLLSNAR WW 

ATCAATGATGTATACTCACCACGGATTATCGGTGGGATTCACTGGG 
1 ' 1 ' ■ ■ I ' ■ i -H > — m — . — | — ~ 

tagttactacatatgagtggtgcctaatagccaccctaagtgaccc 
smmythhglsvgftgJ 
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: Hinc II 

TGTGTATCTGATGCTGGTCAACGATCTTATTCACGGGCTTTTCCCA^ 

— —H : 1 i ~H ; 1 — H ]•■•■>• 

ACACATAGA'CTACGACC'AGTTGCTAGAATAAGTGCCCGAAAAGGGT 
VYLMLVNDL I HGLFP 



TTGTATTCCCGTTCAAGATGGGGGTGTTGGCTTTGACTATCGGCTG 

H- ■ | i 1 1 I h- 1 1 1 1 ■ H- 

AACATAAGGGCAAGTTCTACCCCCACAACCGAAACTGATAGCCGAC 
C I PVQDGGVGFDYRL 



GGATGAGGATTGGAGAGTGGGTGATATTGTTCATACACTGACAAAT 
h— H 1 1 ■ ■ ■ ■ | ■ | 

CCTACTCCTAACCTCTCACCCACTATAACAAGTATGTGACTGTTTA 



D 



D W R 



I 



H T L T N 



TCAAGCTCTAGTCGGTGATAAAACTATAGCATYCTGGCTGATGGAC 
1 —H ' I I ■ — — h~ — 

AGTTCGAGATC AGCC ACTATTTTGATATCGTARGACCGACTACCTG 
Q A L V G D 



K 



I 



W 



M D 
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ATT AATAGATCGTGGGATAGCATTGCACAAGATGATTAGGCTTGTA 
■ 1 1 I I I ■ — — - 

TAATTATCTAGCACCCTATCGTAACGTGTTCTACTAATCCGAACAT 
L I O RG I ALHKM IRLV-V 
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; EcoR I 

TGAATTCGGCCACCCTGAGTGGATTGATTTCCCTAGGGCTGARCAA^ 
" 1 H — 1 — 1 1 1 ! 1 i ,.| 

ACTTAAGCCGGTGGGACTCACCTAACTAAAGGGATCCCGACTYGTT 

EF G HPEWID FPRAEQ 

.Ssp I 

TGATAAATGCA GACGGAGATTTGACCTGGGAGATGCAGAATATTTA 

. ' ' ' 1 ' 1 1 1 [ ' ' i j 

ACTATTTACGTCTGCCTCTAAACTGGACCCTCTACGTCTTATAAAT 
DK CRRRFDLGDAEY L 

TGAAGATAA ATATGAGTTTATGACTTCAGAACACCAGTTCATATr a 
"""" 1 ' 1 1 1 1 — I ' H i | . . 

ACTTCTATTTATACTCAAATACTGAAGTCTTGTGGTCAAGTATAGT 
EDI <YEFMTSEHQFIS 



CCTAGTTTTTGT CTTTAATTTTCACTGGACAAATAGCTATTrARAr 

1 1 1 1 1 1 i ' ■ I ■ 1 1 | 

GGATCAAAAACAGAAATTAAAAGTGACCTGTTTATCGATAAGTCTG 
LVfr VFNFHWTNSYSD 



GGACTCAGATGATCCACTTTTTGGTGGCTTCGGGAGAATTGATCAT 

H ' ' 11 ' I i | . . i — | , . 

CCTGAGTCTACTAGGTGAAAAACCACCGAAGCCCTCTTAACTAGTA 
DSD DPLFGGFG RIDH 

YCGYYCAATTATGGTGTATGCACCTAGTAGAACAGCAGTGGTCTAT 

' ' 1 1 I 1 | , ■ , , 1 , . 

RGCRRGTTAATACCACATACGTGGATCATCTTGTCGTCACCAGATA 
R? IMVYAPSRT AVVY 

NGAAGAATTTT 

1 — — 2531 

NCTTCTTAAAA 

E E F 
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70 
71 
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1 



138 
140 
140 

33 

1 



208 
210 
210 
48 



278 

280 

280 

57 

50 



A 6/75 

1 1 r~ 

10 20 30 

fl-GATGGG@CCTTGAACTCAGCAATTTGACACTCAGT 
TjGATGGG-CCTTGAACTCAGCAATTTGACACTCAGT 
T jGATGGGBCCTTGAACTCAGCAATTTGACACTCAGT 
T- 




i — i : j 

80 90 100 

tttttctcttaattccaaccaagg-aatgaataaaa0 
tttttctcttaattccaaccaBggBaatgaataaaag 
tttttctcttaattccaaccaagg - aatg aataaaag 

SaaBag 




150 



160 

_L 



170 



GAAAGATGGTGTATACACTCTCTGGAGTTCGTTTTCC I Fig 10 
GAAAGATGGTGTATAflACTCTCTGGAGTTCGTTTTCC [Sheet 2 
GAAAGATGGTGTATACACTC TCTGGAGTTCGTTTTCC 

iTCfl 




220 230 240 

CAGCAGTAATGGTGATCGGAGGAATGCTAAT0TTTCT 

CAGCAGTAATGGTGATCGGAGGAATGCTAATGTTTCT 

CA GCAGTAATGGTGATCGGAGGAATGCTAATGTTT CT 
CA l 

1g6atgctaatgtttct 




290 



— i — 

300 



310 




atcttggctgaaaagtcttcttacaattccgaathcc 
atcttggctgaaaagtcttcttacaattccgaattcc 
atcttggctgaaaagtcttcttacaattccgaattcc 
atcttggctgaaaagtcttcttacaattccgaattcc 
atcttggctgaaaagtcttcttacaattccgaat^cc 
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350 
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127 
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418 
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197 
190 



488 
490 
490 
267 
260 



558 
560 
560 
337 
330 



628 
630 
630 
407 
400 



48/75 

1 1 1 

360^, 370 380 

TTGTGCCTGGAAflCCAGAGTGATAGCTCCTCATCCTC 
TTGTGCCTGGAACCCAGAGTGATAGCTCCTCATCCTC 
TTGTGCCTGGAACCCAGAGTGATAGCTCCTCATCCTC 

ttgtgcctggaacccagagtgatagctcctcatcctc 
ttgtgcctggaaBccagagtgatagctcctcatcctc 



430 
j. 



440 



450 

JL 



agaaaattccccagcatcaactgatgtagatagttca 
agaaaattccccagcatcaactgatgtagatagttca 
agaaaattccccagcatcaactgatgtagatagttca 

AGAAAATTCCCCAGCATCAACTGATGTAGATAGTTCA 
AGAAAATTCCCCAGCATCAACTGATGTAGATAGTTCA 



500 



510 



520 



AACGATGACGTTGAGCCGTCAAGTGATCTTACAGGAA 
AACGATGACGTTGAGCCGTCAAGTGATCTTACAGGAA 
AACGATGACGTTGAGCCGTCAAGTGATCTTACAGGAA 
AACGATGACGTTGAGCCGTCAAGTGATCTTACAGGAA 
AACGATGACGTTGAGCCGTCAAGTGATCTTACAGGAA 



570 

JL 



580 



590. 



AACTACAAGAAGGTGGTAAACTGGAGGAGTCTAAAAC 
AACTACAAGAAGGTGGTAAACTGGAGGAGTCTAAAAC 
AACTACAAGAAGGTGGTAAACTGGAGGAGTCTAAAAC 
AACTACAAGAAGGTGGTAAACTGGAGGAGTCTAAAAC 
AACTACAAGAAGGTGGTAAACTGGAGGAGTCTAAAAC 



640 



650 



660 



ATCTGATAGGATCAGAGAGAGGGGCATCCCT.CC ACCT 
ATCTGATAGGATCAGAGAGAGGGGCATCCCTCCACCT 
ATCTGATAGGATCAGAGAGAGGGGCATCCCTCCACCT 
ATCTGATAGGATCAGAGAGAGGGGCATCCCTCCACCT 
ATCTGATAGGATCAGAGAGAGGGGCATCCCTCCACCT. 
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698 
700 
700 
477 
470 



768 
770 
770 
547 
540 



838 
839 
840 
617 
610 



908 
909 
910 
687 
680 



978 
979 
980 
757 
750 



710 



720 



730 



CTTTTGACAAACTATCGTCAACACCTTGATTACAGGT 
CTTTTGACAAACTATCGTCAACACCTTGATTACAGGT 
CTTTTGACAAACTATCGTCAACACCTTGATTACAGGT 
CTTTTGACAAACTATCGTCAACACCTTGATTACAGGT 
CTTTTGACAAACTATCGTCAACACCTTGATTACAGGT 



780 



790 
j. 



800 



ACAAGTATGAGGGTGGTTTGGAAGCTTTTTCTCGTGG 
ACAAGTATGAGGGTGGTTTGGAAGCflTTTTCTCGTGG 
ACAAGTATGAGGGTGGTTTGGAAGCfflTTTTCTCGTGG 
ACAAGTATGAGGGTGGTTTGGAAGCTTTTTCTCGTGG 
ACAAGTATGAGGGTGGTTTGGAAGCTTTTTCTCGTGG 



850~ 



860 



870 

JL 



AGGTATCACTTACCGTGAGTGGGCTCCTGGTGCCCAG 
AGGTATCACTTACCGTGAGTGGGCTCCTGGTGCCCAG 
AGGTATCACTTACCGTGAGTGGGCTCQTGGTGCCCAG 
AGGTATCACTTACCGTGAGTGGGCTCCTGGTGCCCAG 
AGGTATCACTTACCGTGAGTGGGCTCCTGGTGCCCAG 



920 930 940 

GACGCAAATGCTGACflTTATGACTCGGAATGAATTTG 
GACGCAAATGCTGACATTATGACTCGGAATGAATTTG 
GACGCAAATGCTGACATTATGACTCGGAATGAATTTG 
GACGCAAATGCTGACATTATGACTCGGAATGAATTTG 
GACGCAAATGCTGACATTATGACTCGGAATGAATTTG 



990 



1000 



1010 
I 
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ATGGTTCTCCTGCAATTCCTCATGGGTCCAGAGTGAA 
ATGGTTCTCCTGCAATTCCTCATGGGTCCAGAGTGAA 
ATGGTTCTCCTGCAATTCCTCATGGGTCCAGAGTGAA 
ATGGTTCTCCT-GCAATTCCTCATGGGTCCAGAGTGAA 
ATGGTTCTCCTGCAATTCCTCATGGGTCCAGAGTGAA^ 
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1048 

1049 

1050 

827 

820 



1 1 18 
1 1 19 
1 120 
895 
890 



1 188 
1 189 
1 190 
965 
960 



1258 
1259 
1260 
1035 
1030 



1328 
1328 
1329 
1 104 
1099 



1060 



1070 



1080 



ttccattc'ctgcttggatcaactactctttacagctt 
ttccattcctgcttggatcaactactctttacagctt 
ttccattcctgcttggatcaactactcttta-cagctt 
ttccattcctgcttggatcaactajctcBHtacagctt 
ttccattcctgcttggatcaactactctttacagctt 



1 130 

_L 



1 140 



1 150 



GATCCACCCGAAGAGGAGAGGTATATCTTCCAACACC 
GATCCACCCGAAGAGGAGAGGTATATCTTCCAACACC 
GATCCACCCGAAGAGGAGAGGTATATCTTCCAACACC 
GATCCACCCGAAGAGGAGAGGTATATCTTCCAACACC 
GATCCACCCGAAGAGGAGAGGTAT0TCTTCCAACACC 



1200 



1210 
x 



1220 



ATGAATCTC ATATTGGAATGAGTAGTCCGGAGCCTAA 
ATGAATCTCATATTGGAATGAGTAGTCCGGAGCCTAA 
ATGAATCTC ATATTGGAATGAGTAGTCCGGAGCCTAA 
ATGAATCTC ATATTGGAATGAGTAGTCCGGAGCCTAA 
ATGAATCTC ATATTGGAATGAGTAGTCCGGAGCCTAA 



■ , , , 

1270 IjSO 1290 ^ 

TCTTCCTCGCATAAAAAA0GCTTGGGTACAATGCG@T 
TCTTCCTCGCATAAAAAA-GCTTGGGTACAATGC.GCT 
TCTTCCTCGCATAAAAAA-GCTTGGGTACAATGCGCT 
TCTTCCTCGCATAAAAAA-GCTTGGGTACAATGCGCT 
TCTTCCTCGCATAAAAAA-@CTTGGGTACAATGCG@T 

-2fe , 



1340 



1350 

JL 



1360 



TGCTAGTTTTGGTTATCATGTCACAAATTTTTTTGCA 
TGCTAGTTTTGGTTATCATGTCACAAATTTTTTTGCA 
BGCTAGTTTTGGTTATCATGTCACAAATTTTTTTGCA 
TGCTAGTTTTGGTTATCATGTCACAAATTTTTTTGCA 
TGCTAGTTTTGGTTATCATGTCACAAATTTTTTTGCA^ 
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1398 
1398 
1399 
11 74 
1 169 



1468 
1468 
1469 
1244 
1239 



1538 
1538 
1539 
1314 
1309 



1410 



1420 



1430 



AAGTCTTTGATTGATAAAGCTCATGAGCTAGGAATTG 
AAGTCTT@GATTGATAAAGCTCATGAGCTAGGAATTG 
AAGTCTTTGATTGATAAAGCTCATGAGCTAGGAATTG 
AAGTCTTTGATTGATAAAGCTCATGAGCTAGGAATTG 
AAGTCTTTGATTGATAAAGCTCATGAGCTAGGAATTG 



1480 
x 



1490 

JL 



1500 



caaataatactttagatggactgaacatgtttgacgg 
caaataatactttagatggactgaacatgtttgacgg 
caaataatactttagatggactgaacatgtttgacQg 
caaataatactttagatggactgaacatgtttgacgg 
caaataatactttagatggactgaacatgtttgacgg 



1550 
x 



1560 
x 



1570 
x 



TGGTTATCATTGGATGTGGGATTflCCGCCTCTTTAAC 
TGGTTATCATTGGATGTGGGATTflCCGCCTCTTTAAC 
TGGTTATCATTGGATGTGGGATTCCCGCCTCTTTAAC 
TGGTTATCATTGGATGTGGGATTCCCGCCTflTTTAAC 
TGGTTATCATTGGATGTGGGATTCCCGCCTCTTTAAC 



1620 



1630 



1 640 
x 
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1608 TCAAATGCGAG'ATGGTGGTTGGATGAGTTCAAATTTG 
1607 TCAAATGCGAGATGGTGGTTGGATGAGTTCAAATTTG 

1609 TCAAATGCGAGATGGTGGTTGGATGBGTTCAAATTTG 
1384 TCAAATGCGAGATGGTGGTTGGATGAGTTCAAATTTG 
1379 TCAAATGCGAGATGGTGGTJGGATGAGTTCAAATTTG 

1 1 1 

1690 1700 1710 

1678 TGTgTACTCACCACGGATTATGGGTGGGATTCACTGG 
1677 TGTATACTCACCACGGATTATCGGTGGGATTCACTGG 

1679 TGTATAflTCACCACGGATTATCGGTGGGATTCACTGG 
1454 TGTATACTCACCACGGATTATCGGTGGGATTCACTGG 
1.449 TGTATACTCACCACGGATTATCGGTGGGATTCACTGG j 
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1760 1770 1780 

1748 TGTGGATGCTGTTGTGTATCTGATGCTGGTCAACGAT 
1747 TGTGGATGCTGTTGTGTATCTGATGCTGGTCAACGAT 

1749 TGTGGATGCTGTTGTGTATCTGATGCTGGTCAACGAT 
1 524 TGTGGATGCTGTTGTGTATCTGATGCTGGTCAACGAT 
1519 TGTGGATGCTGTTGTGTATCTGATGCTGGTCAACGAT 



1830 1840 1850 

1818 ATTGGTGAAGATGTTAGCGGAATGCCGACATTTTGTg 
1817 ATTGGTGAAGATGTTAGCGGAATGCCGACATTTTGTA 

1819 ATTGGTGAAGATGTTAGCGGAATGCCGACATTTTGTA 
1 594 ATTGGTGAAGATGTTAGCGGAATGCCGACATTTTGTA 
1 589 ATTGGTGAAGATGTTAGCGGAATGCCGACATTTTGTA 



1888 
1887 
1889 
1664 
1659 



1958 
1957 
1959 
1734 
1729 



1900 



1910 



1920 



ATCGGCTGCATATGGCAATTGCTGATAAATGGATTGA 
ATCGGCTGCATATGGCAATTGCTGATAAATGGATTGA 
ATCGGCTGCATATGGCAATTGCTGATAAABGGATTGA 
ATCGGCTGCATATGGCAATTGCTGATAAATGGATTGA 
ATCGGCTGCATATGGCAATTGCTGATAAATGGATTGA 



1970 

JL 



1980 

_L 



1990 



GGGTGATATTGTTCATACACTGACAAATAGAAGATGG 
GGGTGATATTGTTCATACACTGACAAATAGAAGATGG 
GGGTGATATTGTTCATACACTGACAAATAGAAGATGG 
GGGTGATATTGTTCATACACTGACAAATAGAAGATGG 
GGGTGATATTGTTCATACACTGACAAATAGAAGATGG 
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2040 



2050 



2060 



2028 gatcaagctctagtcggtgataaaactatagcattct 
2027 gatcaagctctagtcggtgataaaactatagcattct 

2029 gatcaagctctagtcggtgataaaactatagcattct 
1804 gatcaagctctagtcggtgataaaactatagcattct 
1799 gatcaagctctagtcggtgataaaactatagcatDct 
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